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GENBANK.RTM. COPYRIGHT 2001 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 

KEYWORDS (ST) : 
SOURCE : 
ORGANISM (ORGN) : 



BF509252 GenBank (R) 

BF509252 

307875-04-3 

791 

rtiRNA; linear 
Expressed sequence tag 
6 Dec 2000 

UI-H-BI4-aow-c-07-0-UI.sl NCI_CGAP_Sub8 Homo sapiens 
cDNA clone IMAGE : 308 6220 3\ mRNA sequence. 
EST 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; 



Catarrhini; 



(NA) 



Hominidae; Homo 
: 270 a 151 c 



144 g 226 t 



NUCLEIC ACID COUNT 
COMMENT : 

Contact: Robert Strausberg, Ph.D. 
Email : cgapbs-r(§mail . nih . gov 

The sequence contained an oligo-dT track that was present in the 
oligonucleotide that was used to prime the synthesis of first 
strand cDNA and therefore this may represent a bonafide poly A 
tail. cDNA Library Preparation: M.S. Scares Lab Clone distribution: 
NCI-CGAP clone distribution information can be found through the 
I.M.A.G.E. Consortium/LLNL at: 
www-bio . llnl , gov/bbrp/ image/image . html 
Seq primer: M13 Forward 
POLYA=Yes. 

1 (bases 1 to 791) 

NCI -CGAP ht tp : / /www . ncbi . nlm . nih . gov/ncicgap . 
National Cancer Institute, Cancer Genome Anatomy 
Project (CGAP), Tumor Gene Index 
Unpublished (1997) 



REFERENCE : 

AUTHOR (AU) 
TITLE (TI) : 



JOURNAL (SO) : 



FEATURES (FEAT) : 

Feature Key 
===============+== 

source 1 , 



Location 
.791 



Qualifier 



/organism="Homo sapiens" 
/db-xref="taxon: 9606" 
/clone="IMAGE: 3086220" 
/clone-lib="NCI-CGAP-Sub8" 
/lab-host="DH10B (Life 
Technologies ) " 
/note="Vector : pT7T3D-Pac 
(Pharmacia) with a modified 
polylinker; Site-1: Not I; Site-2 : 
Eco RI; NCI-CGAP-Sub8 is a 
subtracted library derived from 
NCI-CGAP-Sub5. The NCI-CGAP-Sub8 
library had 2.5 million 
recombinants. A single-stranded 
DNA preparation of NCI-CGAP-Sub5 
was used as a tracer in a 
subtractive hybridization with a 
driver comprising: a pool of 
clones from NCI-CGAP-Sub5 (IMAGE 



clone Ids 2732833-2737415, 
3068040-3069191; 25% of the driver 
population) , a pool of clones from 
NCI-CGAP-Sub4 (IMAGE clone Ids 
2723592-2729326; 25% of the driver 
population) , NCI-CGAP-Sub6 (pool 
AIF-AJU, IMAGE Ids 

2728969-2733190; 25% of the driver 
population) , and NCI-CGAP-Sub7 
(IMAGE Ids 3069192-3072238, 
3081864-3084550 ; 25% of the 
driver population) . Subtraction 
was performed as previously 
described [Bonaldo, Lennon & 
Soares (1996) : Normalization and 
Subtraction: Two Approaches To 
Facilitate Gene Discovery. Genome 
Research 6, 791-806. 
TAG-LIB=NCI-CGAP-Lu5 
TAG-TISSUE=lung TAG-SEQ=CAAC" 



SEQUENCE (SEQ) : 

1 tttttttttt 

61 gctatcaacg 

121 ctttcccgtt 

181 cctgtaaagg 

241 cgggagtatc 

301 tatgttgttc 

361 gatattgaac 

421 gctaaaaatg 

481 acaaaagatg 

541 gatgttgcca 

601 gacccggtac 

661 attatctgcc 

721 tctaggagct 

781 atctcgtgcc 



tttgttacca 
atgaatgcta 
tacggcttcc 
agagaagagc 
tgaagcagca 
catatacaat 
aacttaaaga 
aaaataacag 
cccaaggacc 
tgaatatcat 
taccagctcg 
tcctgaaatg 
gttaacaagc 

g 



tgaaatcatc 
tcaagtaaga 
acttgagtat 
tcatgctagg 
tgcagctgtt 
tcaccttttg 
tgttaaagaa 
tcacgctttt 
agatgatgca 
catgtcaaag 
tttcttcact 
aaatcatttt 
actttcatca 



acattagaac 
caagtgtttg 
atggcaatct 
caatgtttgg 
agtgaaaaat 
gcacatgacc 
tgtctttggt 
atcagaaaga 
aaaatgaatg 
agtactacat 
caacctgaca 
tcactcctgg 
gcaggcaagc 



aatatcagct 
cccagaaact 
gtgccctttg 
tgaaaaatat 
tattgtctct 
cagattatgt 
ttgttctgga 
tggtagaaaa 
aaaaactgta 
acagtttgga 
agatttcagt 
aaaacctaaa 
atctcagaca 



atgtgcatta 
tcacaaaggc 
tgcaaaagat 
aaatgtaagg 
tctaccagag 
caaagtacag 
aatattaatg 
tattaaacaa 
cactgtgtgt 
atctcctaaa 
aacaccaaaa 
acaacaatgt 
gatcatcaga 



=> 
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GENBANK.RTM, COPYRIGHT 2001 



LOCUS (LOG) : HSU95825 
GenBank AGC. NO. (GEN): U95825 
CAS REGISTRY NO. (RN) : 225494-81-5 
SEQUENCE LENGTH (SQL) 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 



GenBank (R) 



SOURCE: 
ORGANISM (ORGN) : 



Catarrhini; 



5253 

mRNA; linear 
Primates 
5 Apr 1999 

Human androgen-induced prostate proliferative shutoff 
associated protein (ASS) mRNA, complete cds . 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; 



Hominidae; Homo 
NUCLEIC ACID COUNT (NA) : 1755 a 944 c 1074 g 1480 t 
COMMENT : 

On Apr 5, 1999 this sequence version replaced gi: 4539617. 



REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) 

REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 
JOURNAL (SO) : 
Tufts 

Boston, 

REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 
JOURNAL (SO) : 
Tufts 

Boston, 



1 (bases 1 to 5253) 

Geek, P.; S2elei,J.; Jimenez, J.; Soto, A.M.; 
Sonnenschein, C. 

Androgen-induced proliferative shutoff in prostate 
cancer cells 

Proc. Annu. Meet. Am. Assoc. Cancer Res., 37, 223-223' 
(1996) 

2 (bases 1 to 5253) 

Geek, P.; Szelei,J.; Jimenez, J.; Sonnenschein, C . ; 
Soto, A.M. 

Early gene expression during androgen-induced 
inhibition of proliferation of prostate cancer cells: 

new suppressor candidate on chromosome 13, in the 
BRCA2-Rbl locus 

J. Steroid Biochem. Mol. Biol., 68 (1-2), 41-50 (1999) 
CA 131:3561 

3 (bases 1 to 5253) 

Geek, P.; Szelei,J.; Jimenez, J.; Sonnenschein, C . ; 

Soto, A.M. 

Direct Submission 

Submitted ( 28-MAR-1997 ) Anatomy and Cell Biology, 
University Medical School, 136 Harrison Avenue, 

MA 02111, USA 

4 (bases 1 to 5253) 

Geek, P.; Szelei,J.; Jimenez, J.; Sonnenschein, C . ; 

Soto, A.M. 

Direct Submission 

Submitted ( 05-APR-1999) Anatomy and Cell Biology, 
University Medical School, 136 Harrison Avenue, 
MA 02111, USA 



FEATURES (FEAT) : 

Feature Key Location 



Qualifier 



/organism="Homo sapiens" 

/db-xref ="taxon : 9606" 

/ chromos ome= "13" 

/map="13ql2-ql3" 

/cell-line="LNCaP" 

/cell-type="cancer cells" 

/ t issue- type= "prostate" 

/note="exon 1 maps to PI 

Artificial Chromosome PAC26H23; 

exons 2-24 map to cosmid 267pl9; 

axons 25-34 map to PI Artificial 

Chromosome PAC 4 9J10" 

/gene="AS3" 

/gene="AS3" 

/ number=l 

/gene="AS3" 

/number=2 

/gene="AS3" 

/codon-start=l 

/product="androgen-induced 

prostate proliferative shutoff 

associated protein" 

/protein-id="AAD22134 .2" 

/db-xref="GI:4 559410" 

/translation="MAHSKTRTNDGKITYPPGVK 

E I S DKI S KEEMVRRLKMVVKT FMD 

MDQDSEEEKELYLNLALHLASDFFLKHPDKDVRL 

LVACCLADIFRIYAPEAPYTSPDK 

LKDIFMFITRQLKGLEDTKSPQFNRYFYLLENIA 

WVKSYNICFELEDSNEIFTQLYRT 

LFSVINNGHNQKVHMHMVDLMSSIICEGDTVSQE 

LLDTVLVNLVPAHKNLNKQAYDLA 

KALLKRTAQAI E P Y I TN FFNQVLMLGKT S I S DLS 

EHVSDLILELYNIDSHLLLSVLPQ 

LEFKLKSNDNEERLQVVKLLAKMFGAKDSELASQ 

NKPLWQCYLGRFNDIHVPIRLECV 

KFASHCLMNHPDLAKDLTEYLKVRSHDPEEAIRH 

DVIVSIVTAAKKDILLVNDHLLNF 

VGERTLDKRWRVRKEAMMGLAQIYKKYALQSAAG 

KDAAKQIAWIKDKLLHIYYQNSID 

DRLLVERIFAQYMVPHNLETTERMKCLYYLYATL 

DLNAVKALNEMWKCQNLLRHQVKD 

LLDLIKQPKTDASVKAIFSKVMVITRSLPDPGKA 

QDFMKKFTQVLEDDEKIRKQLEVL 

VSPTCSCKQAEGCVREITKKLGNPKQPTNPFLEM 

IKFLLERIAPVHIDTESISALIKQ 

VNKSIDGTADDEDEGVPTDQAIRAGLELLKVLSF 

THPISFHSAETFESLLACLKMDDE 

KVAEAALQIFKNTGSKIEEDFPHIRSALLPVLHH 

KSKKGPPRQAKYAIHCIHAIFSSK 

EAQFAQIFEPLHKSLDPSNLEHLITPLVTIGHIA 

LLAPDQFAAPLKSLVATFIVKDLL 

MNDRLPGKKTTKLWVPDEEVSPETMVKIQAIKMM 

VRWLLGMKNNHSKSGTSTLRLLTT 

ILHSDGDLTEQGKISKPDMSRLRLAAGSAIVKLA 

QEPCYHEIITLEQYQLCALAINDE 

CYQVRQVFAQKLHKGLSRLRLPLEYMAICALCAK 

DPVKERRAHARQCLVKNINVRREY 

LKQHAAVSEKLLSLLPEYVVPYTIHLLAHDPDYV 

KVQDIEQLKDVKECLWFVLEILMA 



KNENNSHAFIRKMVENIKQTKDAQGPDDAKMNEK 

LYTVCDVAMNIIMSKSTTYSLESP 

KDPVLPARFFTQPDKNFSNTKNYLPPGMKSFFTP 

GKPKTTNVLGAVNKPLSSAGKQSQ 

TKSSRMETVGNASSSSNPSSPGRIKGRLDSSEMD 

HSENEDYTMSSPLPGKRSDKRDDS 

DLVRSELEKPRGRKKTPVTEREEKLGMDDLTKLV 

QEQRPKGSQRSRKRGHTASESDEQ 

QWPEEKRLKEDILENEDEQNSPPKKGKRGRPPKP 

LGGGTPKEEPTMKTSKKGSKKKSG 

PPAPEEEEEEERQSGNTEQKSKSKQHRVSRRAQQ 

RAESSESSAIESTQSTPQKGRGRP 

SKTPSPSQPKKNV" 



exon 


174 . , 


.377 


/gene="AS3" 
/number-3 


exon 


378 . , 


.464 


/gene-"AS3" 
/number=4 


exon 


4'65. . 


.562 


/gene="AS3" 
/number=5 


exon 


563. . 


.689 


/gene="AS3" 
/number^6 


exon 


690. . 


.770 


/gene="AS3" 
/nuniber=7 


exon 


771. , 


.911 


/gene="AS3" 
/number=8 


exon 


912. . 


.1027 


/gene="AS3" 
/nuniber=9 


exon 


1028, 


. .1122 


/gene="AS3" 
/number=10 


exon 


1123. 


. .1268 


/gene="AS3" 
/number=ll 


exon 


1269. 


. .1420 


/gene-"AS3" 
/number=12 


exon 


1421. 


. . 1534 


/gene="AS3" 
/number=13 


exon 


1535. 


. .1616 


/gene="AS3" 
/number=14 


exon 


1617. 


. .1665 


/gene="AS3" 
/number=15 


exon 


1666 . 


. .1805 


/gene= AS3 
/number=16 


exon 


1806. 


. .1921 


/gene="AS3" 
/number=17 


exon 


1922. 


. .2027 


/gene="AS3" 
/number=18 


exon 


2028. 


. .2188 


/gene="AS3" 
/number=19 


exon 


2189. 


. .2312 


/gene="AS3" 
/number=20 


exon 


2313. 


. . 2471 


/gene="AS3" 
/nurriber=21 


exon 


2472. 


. .2540 


/gene="AS3" 
/number=22 


exon 


2541. 


. .2677 


/gene="AS3" 
/number=23 


exon 


2678. 


. .2801 


/gene="AS3" 
/nurtiber=24 


exon 


2802. 


. .3006 


/gene="AS3" 
/number=25 


exon 


3007. 


. .3121 


/gene="AS3" 



exon 


3122. 


.3254 


exon 


3255. 


.3374 


exon 


3375. 


.3437 


exon 


3438 . 


. 3583 


exon 


3584. 


.3689 


exon 


3690. 


.4129 


exon 


4130. 


.4354 


exon 


4355. 


.4810 



/number=2 6 

/gene="AS3" 

/number=27 

/gene="AS3" 

/number=28 

/gene="AS3" 

/number=29 

/gene="AS3" 

/number=30 

/gene="AS3" 

/nuitiber=31 

/gene="AS3" 

/number=32 

/gene="AS3" 

/nuiTiber=33 

/gene="AS3" 

/number=34 



SEQUENCE (SEQ) : 

1 ccggagagcc 

61 ctgtcatggc 

121 tcaaggaaat 

181 tgaaaacttt 

241 tagctttaca 

301 tggtagcctg 

361 cccctgataa 

421 atacaaagag 

481 agtcatataa 

541 gaaccttatt 

601 accttatgag 

661 ttttagtaaa 

721 aggctttact 

781 aggttctgat 

841 ttttggagct 

901 ttaaattaaa 

961 tgtttggggc 

1021 tgggcaggtt 

1081 attgtctcat 

1141 cacatgaccc 

1201 aaaaggatat 

1261 acaaacgatg 

1321 atgctttaca 

1381 aattgctaca 

1441 ttgctcaata 

1501 acttgtatgc 

1561 aaaatctgct 

1621 atgccagtgt 

1681 ctggtaaggc 

1741 taagaaagca 

1801 gtgtgcgtga 

1861 aaatgatcaa 

1921 gtgctcttat 

1981 gtgttccaac 

2041 cacatcccat 

2101 tggatgatga 

2161 ttgaagagga 

2221 aaaaaggacc 

2281 gtaaagaggc 

2341 acctggaaca 

2401 atcaatttgc 



ccggagtgag 
tcattcaaag 
atcagataaa 
tatggatatg 
tcttgcttca 
ctgccttgct 
actaaaggat 
cccacagttc 
catatgcttt 
ttcagttata 
ctctattatt 
tctggtacct 
gaagaggaca 
gcttgggaaa 
ctacaatatt 
gagcaatgat 
aaaggattca 
taatgatatc 
gaaccatcct 
tgaggaagct 
tcttctggtc 
gagagtacgc 
gtcagcagct 
tatatattat 
catggttcct 
cacactggat 
ccgacatcaa 
caaggccata 
tcaggatttc 
gttagaagta 
aataactaag 
gtttctcttg 
taaacaagtg 
tgatcaagcc 
ctcatttcat 
aaaagtagca 
ttttccacac 
cccccgtcaa 
ccagtttgca 
tctcataaca 
tgctcctttg 



cggagtagcg 
actaggacca 
atatctaaag 
gaccaggact 
gatttttttc 
gatattttca 
atatttatgt 
aataggtatt 
gagttagaag 
aacaatggcc 
tgtgaaggtg 
gctcataaga 
gctcaagcta 
acatctatca 
gatagtcatt 
aatgaggagc 
gaattggctt 
catgtaccaa 
gatttagcaa 
attagacatg 
aatgatcact 
aaagaagcca 
ggaaaagatg 
caaaatagta 
cacaatttag 
ttaaatgctg 
gtaaaggatt 
ttttcaaaag 
atgaagaaat 
cttgttagtc 
aagttgggca 
gagaggatag 
aacaaatcaa 
atcagagcag 
tctgctgaaa 
gaagctgcac 
atcagatcag 
gccaaatatg 
cagatatttg 
ccattggtta 
aaatctttgg 



agtcggcaac 
atgatggaaa 
aggagatggt 
ctgaagaaga 
tcaagcatcc 
ggatttatgc 
ttataacaag 
tttatttgct 
atagcaatga 
acaatcagaa 
atacagtgtc 
atttaaacaa 
ttgagccata 
gcgatttgtc 
tgctgctctc 
gcctacaagt 
ctcaaaacaa 
tccgcctgga 
aagacttaac 
atgttattgt 
tacttaattt 
tgatgggact 
ctgcaaaaca 
ttgatgatcg 
aaactacaga 
tgaaagcatt 
tgcttgactt 
tgatggttat 
tcacacaggt 
caacatgctc 
accccaaaca 
cacctgtgca 
tagatggaac 
gtcttgaact 
catttgaatc 
tacaaatttt 
ccttgcttcc 
ccattcattg 
agcctctgca 
ctattggtca 
tagctacttt 



ccggaggggt 
aattacatat 
gagacgatta 
aaaggagctt 
tgataaagat 
tcctgaagct 
acagttgaag 
tgagaacatt 
aattttcacc 
agtccatatg 
tcaggagctt 
gcaagcatat 
tattaccaat 
agagcatgtc 
tgttttaccc 
tgttaaacta 
gccactttgg 
atgtgtgaaa 
agagtatctt 
gtcaatagtt 
tgtgggagag 
tgcccaaatt 
gatagcatgg 
actacttgtt 
acggatgaaa 
gaatgaaatg 
gattaagcaa 
tacaagaagt 
gttagaagat 
ctgcaagcag 
gcctacaaat 
catagatacc 
agcagatgat 
gcttaaggta 
attactggcc 
caaaaacaca 
tgttttacat 
tatccatgcg 
taagagccta 
tattgctctc 
cattgtgaaa 



agaaatattt 
ccgcctgggg 
aagatggttg 
tatttaaacc 
gttcgcttac 
ccttacacat 
gggctagagg 
gcttgggtca 
cagctataca 
cacatggtag 
ttggatacgg 
gatttggcaa 
ttttttaatc 
tctgacttaa 
cagcttgaat 
ctggcgaaaa 
cagtgctact 
tttgctagcc 
aaagtgaggt 
acagctgcta 
agaacattag 
tataagaaat 
atcaaagaca 
gaacggatct 
tgcttatatt 
tggaaatgtc 
cccaaaacag 
ttacctgatc 
gatgagaaaa 
gctgaaggtt 
cctttcctgg 
gaatctatca 
gaagatgagg 
ctctcattta 
tgtctgaaaa 
ggaagcaaaa 
cacaaatcta 
atattttcta 
gatccaagca 
cttgcacctg 
gatcttctca 



24 61 tgaatgatcg gcttccaggg aaaaagacaa ctaaactttg ggttccagat gaagaagtat 
2521 ctcctgagac aatggtcaaa attcaggcta ttaaaatgat ggttcgatgg ctacttggaa 
2581 tgaaaaataa tcacagtaaa tcaggaactt ctaccttaag attgctaaca acaatattgc 
2641 atagtgatgg agacttgaca gaacagggga aaattagtaa accagatatg tcacgtctga 
2701 gacttgctgc tgggagtgct attgtgaagc tggcacaaga accctgttac catgaaatca 
2761 tcacattaga acaatatcag ctatgtgcat tagctatcaa cgatgaatgc tatcaagtaa 
2821 gacaagtgtt tgcccagaaa cttcacaaag gcctttcccg tttacggctt ccacttgagt 
2881 atatggcaat ctgtgccctt tgtgcaaaag atcctgtaaa ggagagaaga gctcatgcta 
2941 ggcaatgttt ggtgaaaaat ataaatgtaa ggcgggagta tctgaagcag catgcagctg 
3001 ttagtgaaaa attattgtct cttctaccag agtatgttgt tccatataca attcaccttt 
3061 tggcacatga cccagattat gtcaaagtac aggatattga acaacttaaa gatgttaaag 
3121 aatgtctttg gtttgttctg gaaatattaa tggctaaaaa tgaaaataac agtcacgctt 
3181 ttatcagaaa gatggtagaa aatattaagc aaacaaaaga tgcccaagga ccagatgatg 
3241 caaaaatgaa tgaaaaactg tacactgtgt gtgatgttgc catgaatatc atcatgtcaa 
3301 agagtactac atacagtttg gaatctccta aagacccggt actaccagct cgtttcttca 
3361 ctcaacctga caagaatttc agtaacacca aaaattatct gcctcctgga atgaaatcat 
3421 ttttcactcc tggaaaacct aaaacaacca atgttctagg agctgttaac aagccacttt 
34 81 catcagcagg caagcaatct cagaccaaat catcacgaat ggaaactgta ggcaatgcaa 
3541 gcagcagctc aaatccaagc tctcctggaa gaataaaggg gaggcttgat agttctgaaa 
3601 tggatcacag tgaaaatgaa gattacacaa tgtcttcacc tttgccgggg aaaagaagtg 
3661 acaagagaga cgactctgat cttgtaaggt ctgaattgga gaagcctaga ggcagaaaaa 
3721 aaacgcccgt cacagaacgg gaggagaaat taggtatgga tgacttgact aagttggtac 
3781 aggaacagag acctaaaggc agtcagcgaa gtcggaaaag aggccatacg gcttcagaat 
3841 ctgatgaaca gcagtggcct gaggaaaaga ggctcaaaga agatatatta gaaaatgaag 
3901 atgaacagaa tagtccgcca aaaaagggta aaagaggccg accaccaaaa cctcttggtg 
3961 gaggtacacc aaaagaagag ccaacaatga aaacttctaa aaaaggaagc aaaaaaaaat 
4021 ctggacctcc agcaccagag gaggaggaag aagaagaaag acaaagtgga aatacggaac 
4081 agaagtccaa aagcaaacag caccgagtgt caaggagagc acagcagaga gcagaatctt 
4141 ctgaatctag tgcaattgaa tccacacagt ccacaccaca gaaaggacga ggaagaccat 
4201 caaaaacgcc atcaccatca caaccaaaaa aaaatgtgta agttgtaaat attacatttc 
4261 aaaccaattt caaattattt tgcaaaagtt cctaaatttg taaacataca tattgctgta 
4321 tttaaattcc atatatttag ccccattaca ctaggtacgg cggcgaagtg ctaaaaggga 
4381 acggcgatga acaaatgtaa ttaataactt tctctgtgaa agctttggaa aaatcttttt 
4441 tttttttttt tttttttggt caagcttgag gctgaataaa gcctttgatg cacaaaatgg 
4501 gactgctgaa gagtggacag ttggacctta ctttggtgac cccatacatt tgtggtcaca 
4561 tgctttagcc atacacatgg taacattgac tatggagtct tgtgaaagtg taatgtgcga 
4 621 tggctatgta gacataaaga agaaacttgt aaatatcttt tttctttttt ttaatgtttc 
4 681 tgatttctga agtgcttgta tagcttttat ctgcggcttt aaactgacag tacccgactg 
4741 tttattggat ctattgattt gaaaagaatt tgttaggata gatcttaagc agtaatctgt 
4801 cagtgtttgt atttgtattt tctgcaattt tactgtgaaa aaaaatttgt tttcaacaat 
4861 tggtgtcatt ttcttgatgt cactatttgt tggagagtta aatggtctct tccctttgtg 
4921 tatcttacct agtgtttact cctgggcacc cttaatcttc agaggtgcta aattgtctgc 
4981 cattacacca gaaggatgcc tctgatagga ggacaaccat gcaaattgtg aaatagtcct 
5041 gaagttcttg gattacttta cacctcagta ttgatttgtc ccagaatttt ctggcctttc 
5101 atggcaatga aaattttaag aagaaagatt taaagtattt taattttaaa gagtgtgtta 
5161 taaaataatg tactgaattc tttatcccat tttatcatcc tttcagtttt tattaatcta 
5221 ctgtatcaat aaaattctgt aatttgaatg agt 
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LOCUS (LOC) : AC068224 GenBank (R) 

GenBank ACC. NO. (GBN) : AC068224 
CAS REGISTRY NO. (RN) : 263690-85-3 
SEQUENCE LENGTH (SQL) : 54398 
MOLECULE TYPE (CI): DNA; linear 

DIVISION CODE (CI) : High Throughput Genome Sequencing 

DATE (DATE) : 30 Apr 2000 

DEFINITION (DEF) : Homo sapiens chromosome 3 clone RP11-660H19 map 3, 

LOW-PASS SEQUENCE SAMPLING. 
KEYWORDS (ST) : HTG; HTGS_PHASE0 

SOURCE : human . 

ORGANISM (ORGN) : Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 

Euteleostomi; Mammalia; Eutheria; Primates; 

Catarrhini; 

Hominidae; Homo 

NUCLEIC ACID COUNT (NA) : 13631 a 10245 c 8645 g 14936 t 6941 others 
COMMENT : 

All repeats were identified using RepeatMasker : 
Smit, A.F.A. & Green, P. (1996-1997) 

http : / /ftp . genome . Washington . edu/RM/RepeatMasker . html 
Genome Center 

Center; Whitehead Institute/ MIT Center for Genome Research 

Center code: WIBR 

Web site: http://www-seq.wi.mit.edu 

Contact : sequence_submissions@genome . wi .mit . edu 

Project Information 

Center project name: L10161 
Center clone name: 660 H 19 



NOTE: This record contains 68 individual 
sequencing reads that have not been assembled into 
contigs. Runs of N are used to separate the reads 
and the order in which they appear is completely 
arbitrary. Low-pass sequence sampling is useful for 
identifying clones that may be gene-rich and allows 
overlap relationships among clones to be deduced. 
However, it should not be assumed that this clone 
will be sequenced to completion. In the event that 
the record is updated, the accession number will 
be preserved. 

1 685: contig of 685 bp in length 

686 785: gap of 100 bp 

786 1513: contig of 728 bp in length 

1514 1613: gap of 100 bp 

1614 2354: contig of 741 bp in length 

2355 2454 : gap of 100 bp 

2455 3184: contig of 730 bp in length 

3185 3284 : gap of 100 bp 

3285 3959: contig of 675 bp in length 

3960 4059: gap of 100 bp 

4060 4746: contig of 687 bp in length 

4747 4846: gap of 100 bp 

4847 5564: contig of 718 bp in length 

5565 5664: gap of 100 bp 

5665 6360: contig of 696 bp in length 

6361 6460: gap of 100 bp 



6461 7132: contig 

7133 7232: gap of 
7233 7937: contig 

7938 8037: gap of 
8038 8749: contig 

8750 8849: gap of 
8850 9522: contig 

9523 9622: gap of 
9623 10356: contig 
10357 10456: gap of 
10457 11181: contig 
11182 11281: gap of 
11282 11975: contig 
11976 12075: gap of 
12076 12785: contig 
12786 12885: gap of 
12886 13557: contig 
13558 13657: gap of 
13658 14347: contig 
14348 14447: gap of 
14448 15132: contig 
15133 15232: gap of 
15233 15912: contig 
15913 16012: gap of 
16013 16731: contig 
16732 16831: gap of 
16832 17506: contig 
17507 17606: gap of 
17607 18339: contig 
18340 18439: gap of 
18440 19109: contig 
19110 19209: gap of 
19210 19900: contig 
19901 20000: gap of 
20001 20693: contig 
20694 20793: gap of 
20794 21464: contig 
21465 21564: gap of 
21565 22245: contig 
22246 22345: gap of 
22346 23049: contig 
23050 23149: gap of 
23150 23857: contig 
23858 23957: gap of 
23958 24685: contig 
24686 24785: gap of 
24786 25499: contig 
25500 25599: gap of 
25600 26302: contig 
26303 26402: gap of 
26403 27108: contig 
27109 27208: gap of 
27209 27910: contig 
27911 28010: gap of 
28011 28720: contig 
28721 28820: gap of 
28821 29484: contig 
29485 29584: gap of 
29585 30328: contig 
30329 30428: gap of 
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* 30429 31111: contig of 683 bp in length 

* 31112 31211: gap of 100 bp 

* 31212 31931: contig of 720 bp in length 

* 31932 32031: gap of 100 bp 

* 32032 32710: contig of 679 bp in length 

* 32711 32810: gap of 100 bp 

* 32811 33502: contig of 692 bp in length 

* 33503 33602: gap of 100 bp 

* 33603 34301: contig of 699 bp in length 

* 34302 34401: gap of 100 bp 

* 34402 35100: contig of 699 bp in length 

* 35101 35200: gap of 100 bp 

* 35201 35914: contig of 714 bp in length 

* 35915 36014: gap of 100 bp 

* 36015 36709: contig of 695 bp in length 

* 36710 36809: gap of 100 bp 

* 36810 37518: contig of 709 bp in length 

* 37519 37618: gap of 100 bp 

* 37619 38330: contig of 712 bp in length 

* 38331 38430: gap of 100 bp 

* 38431 39109: contig of 679 bp in length 

* 39110 39209: gap of 100 bp 

* 39210 39912: contig of 703 bp in length 

* 39913 40012: gap of 100 bp 

* 40013 40712: contig of 700 bp in length 

* 40713 40812: gap of 100 bp 

* 40813 41514: contig of 702 bp in length 

* 41515 41614: gap of 100 bp 

* 41615 42294: contig of 680 bp in length 

* 42295 42394: gap of 100 bp 

* 42395 43094: contig of 700 bp in length 

* 43095 43194: gap of 100 bp 

43195 43915: contig of 721 bp in length 

* 43916 44015: gap of 100 bp 

^ 44016 44740: contig of 725 bp in length 

^ 44741 44840: gap of 100 bp 

* 44841 45533: contig of 693 bp in length 

* 45534 45633: gap of 100 bp 

* 45634 46330: contig of 697 bp in length 

* 46331 46430: gap of 100 bp 

* 46431 47137: contig of 707 bp in length 

* 47138 47237: gap of 100 bp 

* 47238 47927: contig of 690 bp in length 

* 47928 48027: gap of 100 bp 

* 48028 48709: contig of 682 bp in length 

* 48710 48809: gap of 100 bp 

* 48810 49509: contig of 700 bp in length 

* 49510 49609: gap of 100 bp 

* 49610 50331: contig of 722 bp in length 

* 50332 50431: gap of 100 bp 

* 50432 51135: contig of 704 bp in length 

* 51136 51235: gap of 100 bp 

* 51236 51981: contig of 746 bp in length 

* 51982 52081: gap of 100 bp 

* 52082 52783: contig of 702 bp in length 

* 52784 52883: gap of 100 bp 

* 52884 53580: contig of 697 bp in length 

* 53581 53680: gap of 100 bp 

* 53681 54398: contig of 718 bp in length. 
REFERENCE: 1 (bases 1 to 54398) 
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FEATURES (FEAT) 
Feature Key 



Location 
,54398 



Qualifier 



source 



1. 



/organism="Homo sapiens" 
/db-xref="taxon: 9606" 
/chromosome="3" 
/map-" 3" 

/clone="RPll-660H19" 
/clone-lib="RPCI-ll Human Male 
BAG" 



SEQUENCE (SEQ) : 

1 ggacccatgg 
61 atatgccacc 
121 catccccatt 
181 cccccactca 
241 taattcagat 
301 ggatgtcagg 
361 gagttaggaa 
421 gaggaaaatt 
481 cttccaaaat 
541 tgagcaaggc 
601 acagccaggg 



gcgtatgttc 
tcccaacagc 
tggggacggg 
cgcctgtaat 
gtcatcattt 
ggtgtggttc 
aaatctgaga 
ttagggatca 
gcctttacca 
tgctgccaag 
ggacttgtct 



tannnntgtt 
aaaacaaggc 
agactatgtt 
cccagcactt 
tccagacagg 
ccgtgggcat 
ttatgaaaag 
aaattttagg 
gccacagcag 
ccccagggat 
atgttccaag 



ngagttagng 
ggctccactg 
cgtcgacaga 
tggtattata 
tttgagggcc 
tttgtgctgc 
cagcaacatt 
gacaggtgaa 
tggccagtgc 
gaaccgctgg 
ctccctcaag 



cttccccctt 
ccaggatttc 
attttcttaa 
taaatgattt 
actgttgaaa 
aaaggaaaga 
tgggggttgt 
gctggcgggc 
tcagaaaagg 
cgaaggtgaa 
ggctccaggg 



cttccttaaa 
atcatccctc 
tttaaaattg 
ataaatgata 
gggtggtgat 
aggagaggaa 
ttgaagaagt 
gtaataagga 
caaccaacgc 
atggcctcga 
ggactcctca 



661 gccggtcctc atccccctgc atcccnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

781 nnnnngcaac tggttgcnag aatcccctac tttgtgtnan agaataccct ttcattatgt 

841 gggatagata ctttctaaac acatggtgac tatggttaat aatactgtta ttatatatac 

901 tgaaatttgc taagagagca gagatcaagt gttgtcacca cacacacaca cgagtaacta 

961 ggtgaggtgt ggatatgtta gcttgattgt ggcaaccatt tcacaatata tatgtatatc 

1021 aaaatatcac attgtacact tcaaatgtat acacttttta tttgtcaagt atacctcaat 

1081 aaagctggag aaaaaaataa taataaagag taagaccctt ggaaatgcct tgaggtactt 

1141 ggtgacagtc aatcttctga cctcatctat atccacaacc caggttatat ttctgtttga 

1201 gagaataagg atgtattttt gtttactgga ttatagtgcc agcattagtc ttaactctgt 

1261 ctccttctct gtctctctct ctctctcttt ctctctctct ctctgtatct agctctgctt 

1321 ccctctcttc ccatctccct ccatccatct agtaggaata ttgtgctggc cttggtttcc 

1381 atgctctctc tcccctatgg ctccctcact ctgcctcggc ctgagcgagc atgaatcact 

1441 tctgctgggg tgtgctctgc tgtactgctt ttgttctcag agctgngatg taaatacgac 

1501 tacttcagtg tttnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

1561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnngtggttg 

1621 gggggaacaa actattgctg acaccccaac acttctgtna gnagaaaacc cccataaact 

1681 ttaattgtag gaggggcctc taacattgga caaactttaa agatgtattt tcatgaaagg 

1741 tacttcttta atggaatttg agataattag agcatttatg atgacattta attttgtgaa 

1801 aagtatagtg agcattttta tgtaattttt catttagagt tcatgaaaga ttttctgtgg 

1861 aattggagca tcagggaaat tatggcaaag aatgggcagc cttattctga atatctactg 

1921 catggtttta aagctgtgat tgtgttttat tttagagcaa tgataatgag gagcgcctac 

1981 aagttgttaa actactggca aaaatgtttg gggcaaagga ttcagaattg gcttctcaaa 

2041 acaagccact ttggcagtgc tacttgggca ggtatatgat tttggtttcc catttaaaag 

2101 taatacatta ttctactgac cgttttttta agaaacatta tccacattta aactgtgttc 

2161 tctggaatca taaaaatgaa ggatttttta tgttgaaagt tttgaaagtt gttaatgact 

2221 ggctatattt atatttactc tttttagtct ttatcctcag taatttttaa aagataatac 

2281 attttaaagc tatctcttat ggtattatga ccatgaagtt aatattatta ctttaaggca 

2341 gtgattaatt ccaannnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

24 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngttctg 

24 61 gnngnggtaa catggttaaa agctgtacct cgttanagtt aacactncta cttagaggca 

2521 ttattcattc tttttttccc tcctctcaac cgtcaccaaa ttctgttaat actttctcca 

2581 aatgtcttga acatttatct tctactctcc taccttgcta tattttcact tggattgttg 

2641 tatacattgc ctccctgttg taattgttcc tgtgcctctg acccactgcc tgctgtatca 

2701 cattgctatc aaaattatcc ttctaaaaca gattagatca catcagtccc ttacttgaaa 

2761 tttctgttga ttctctgttg cttgtgggat aaagttgaaa caccttatag ccctttatag 

2821 tttgctctca gcctgagttt caagctttct ttctgctccc tattatgaaa tctgtgtttc 

2881 atttactttc gattacttga aatttgtgta acagtctttt ttctctgttg tcaactctat 

2941 ctggaaattg cctcctgttt ccagacaaac aaacaatgaa caaaaccaag gaacatggtt 

3001 gacacctttt gaatgcttta aaatacagtt ttggggttac cacatcactg tgaccctctt 

3061 tctttttttt tccttcctta cgctggaaaa tgtgaggggt ctgcagtcac atcccacttc 

3121 tgccgtttac ctgtcgtatt ttggcagatt acttaatcat ttgtgcccta gtttccttat 

3181 ttgcnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

3241 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnacataa ctggtgcaaa 

3301 aaccgacaac tcgttagaga aaactgctga ttttgattga gatcatggga atttgcctag 

3361 gaaaagtgaa agttttcacc ccttagatga aaacaacact gctccattga cctctgtcct 

3421 atgaagtgta tgtgtggaag ggagattgag aaaaagcact ctaaactgaa aaagaaaaaa 

3481 gaatactctt gttttccctg aaggcaattt aaaatgacca ggcgcactgc tttttagatg 

3541 aattatgcct tttacatgtc tacatacagc atttcaatga aaaaaaaaaa accctagaat 

3601 cattctaaat gattgataaa agataaaata tgatacatcc aatggatgaa taccatcatt 

3661 acattataaa tactaggtag tacattaaaa tcttcataaa tattaagtaa aaacatcaag 

3721 atacctaact ttaattacat aattattgaa actttctaaa accacaagca tatacaaaaa 

3781 agtagggaac atcttagagt ggcaacataa agtgcaattt ctttcgattt tctatgcttt 

3841 caataatgct gagttattgt tacaccttta ttttaaagct accaaaaaac tttgagtaca 

3901 cagacccaca ctctctgtaa tgttgggtat taccttttgc agatataaac atttatcacn 

3961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4021 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnna atactatttn gcaaaccccc 

4081 cacacttgct tnanagaatc ccctatatga gatggtgttt aagccttaat ttttgtatct 

4141 tatttaagtt agggcagatt tatatttact agtttttttc tgttttaaaa gtatgtttgt 

4201 tttctaacac atctaactgg tacttattag ggcccagatg aatattgcag cttcaaaatg 



4261 aactttagtc acaacatctc actcttatga gcagtgaagc aattattgta gataaacttg 

4321 ctactgttta ctctttactt aaaatagcaa attttatatc tgaagttgtt actctttgct 

4381 gccatttgaa taatatctgg aataattgtg tctttacagg atgaatgcta tcaagtaaga 

4441 caagtgtttg cccagaaact tcacaaaggc ctttcccgtt tacggcttcc acttgagtat 

4501 atggcaatct gtgccctttg tgcaaaagat cctgtaaagg agagaagagc tcatgctagg 

4561 caatgtttgg tgaaaaatat aaatgtaagg cgggagtatc tgaagcagca tgcagctgtt 

4 621 agtggtaagc atataagaaa atggaaagga tactttttca gcctgctagt ttcagtttta 

4 681 tagaatatag catgatatat gatattactg ttatttatta tttgtgatta acaattttta 

4741 ttaccannnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 801 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnngaa tgaggtgctg 

4861 agtctgacga tcggctctna nagaaaacac tttcatcagc aggcaagcaa tctcagacca 

4 921 aatcatcacg aatggaaact gtaagcaatg caagcagcag ctcaaatcca agctctcctg 

4981 gaagaataaa ggggaggtaa gtgcaaaaga aatgccacaa tttacattta aggattccag 

5041 caaggctatg agatctgaaa taatataatc caaaatgttt catgtcagta aatggaaagt 

5101 aatgtattac ttaatttttc catttaacat tggtagtcac aagttcatac ttatttgctt 

5161 gctcattttg gatgggtgct ctgtcttagt ctttcattct ttagcttgct gggggtgatg 

5221 tgggggcgtg ggcgtctccg cgcggcggtg cctggccgtg ccccgctccc cgtgtcctgc 

5281 ccgtcttcgt ctttgcgcgt tcccttccct cccgcttttt tgcccttgtt cccctctccc 

5341 cttttcttcg tttcctttct cccccccccc cgttcctccc ctttcccccg tttttctttc 

5401 cgcgtccttt ttccctcgcc ctttgtgttt ttgccctgtt tccctccctt tgccctttcc 

54 61 ccttccccct tccccccctt ccctcccccc cgttctcctt ttccttgtgc cccccccctt 

5521 ccctcgtgtt cgtcccccgg ttggcccccc cccgtttttc ccgcnnnnnn nnnnnnnnnn 

5581 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

5641 nnnnnnnnnn nnnnnnnnnn nnnnaacact tcgttgctga cctccaaccc tgactctana 

5701 gnatccccca tacttcttca cttacacagt cttctttggc catgtcatca tctatgtaag 

5761 tgactattac atcagcaacg ttaaggtttt tcaagactcg ggtcctactt caagtgctaa 

5821 acagaaattt tatactttta tgttctgtga atataatact gagagttata atatttccca 

5881 gatctattct ttcttgacat gtagagccta tgtaagtgtt aaatatagta gttcacagat 

5941 ctattctttc ttgacatgaa gagcctatat aagtgttaaa taaatctgaa tggcatttct 

6001 tatacctaaa agcatggttt atatttgact cttggcttgt ccttgcacct tgttgcaaag 

6061 tacagataat tctgtccttt gatatccatt atataatttt tattcttact ttcttgacca 

6121 ttcttattgc tgtaatcatc attagctctc accaagactg ttgggaatct tttttattgt 

6181 tggtaaattt ccctaattac agttatgatc atgtcattcc tctgttaaaa tatttagggg 

6241 gttgatgctt gtaaaaacaa ctgcacgtat tttacaacct ggtgtttgag cccctctaat 

6301 attttaaccc aaatgtacgc ttaaaaatct tgttgagcac actctccttt acttaacctn 

6361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

6421 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn gactactttt gcagacaccc 

6481 acccctgctn ctnanagaaa cccccttcag tttttgtttt ttttctgacc tatctttgag 

6541 ttcactcgct tcagttttgt ctagtcgtac ctaaagctca tgctttgaat tcttaatttc 

6601 acatatatac acatacttca attctgtaat tctatagttt ttaaacatag ctctttgttt 

6661 tgctgcagaa tttgcttgtc attaaatcct tgaaaaatct taattatgtg aagtttgtat 

6721 ctgttaggct tatcatcttg atctcttgta acttttgtct attctcaaaa ttttggtcaa 

6781 atcttacctt gtacatatat tgcttttgat tgagtgatag actttttgta tgaaaagttg 

6841 ttgagataat gtgaggcttc agatgtggtt atattcttcc agagttttac ttttgtttct 

6901 atcaggcagt taagctatcg tcagatcact tccatccaat gaaaatggaa gtttggcttc 

6961 agttttggac aagcttatta tttttttctg gtcaccttat gtctagagta tatttgtctg 

7021 ggatccaccc agaagtccag gttgtttaac aaggccttta cttaatgggc tcaggattct 

7081 taatcttgtt cttctagctc tgtgacagga aagctttgct caacttctta tnnnnnnnnn 

7141 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

7201 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnacaacttc ttgctgacac tgccatactt 

7261 gacgctgana gnatccccaa ctaaacanat anaaccacat taaactaaaa agtcacaaag 

7321 ggtacataaa gcacagcaaa ggaaacaatc aacaaaatga aaagcagcct ataggatggg 

7381 aaaaatattt gcaaaccatg tttccaacaa ggaattaatg tcttaaggta cacgaggact 

7441 catacaactc agtagtaaca aaagccaaat cacccaataa aaaatgtgca aaggacctga 

7501 gtagatattt ttccaaagaa gacatataaa taggaaagag gtatatgtaa agatgtcact 

7561 aatcatcagt gaaatagaaa tcaaaaccac aatgatatgt cacttcatac ctgtaggatg 

7621 gctattacaa aaacattaaa agttaagtgt cagtgaggat gtggaggaaa gggatccctt 

7681 atacattttt ggtaggaacg taaattggtg cagccattat gaaaaacagc gtggagatat 

7741 cgcagaagct taaaaataga actaccctgt gatccaacaa tcccacgtct gagtatatat 

7801 ccaaaggaaa tgaaattact atcttgaaga aatatctgac ccttcatgtt catctcagca 



7861 
7921 
7981 
8041 
8101 
8161 
8221 
8281 
8341 
8401 
8461 
8521 
8581 
8641 
8701 
8761 
8821 
8881 
8941 
9001 
9061 
9121 
9181 
9241 
9301 
9361 
9421 
9481 
9541 
9601 
9661 
9721 
9781 
9841 
9901 
9961 
10021 
10081 
10141 
10201 
10261 
10321 
10381 
10441 
10501 
10561 
10621 
10681 
10741 
10801 
10861 
10921 
10981 
11041 
11101 
11161 
11221 
11281 
11341 
11401 



ttattcacag 
aagaaaatgt 
nnnnnnnnnn 
gttggnncgc 
ggccacaaca 
aaagactatt 
catacctgaa 
cttgtgcttt 
cagtacttca 
gaatagacct 
tgccttcttt 
tgtgtattta 
aataaatttt 
ctgtatgtat 
acatgacata 
nnnnnnnnnn 
nnnnnnnnnn 
nagnaacact 
tacaacggct 
tcaatctaca 
ggccgtgtct 
acgcaagaat 
actgactatt 
tcaaaatctg 
ggagacggaa 
ggcgacaacc 
tggatttcca 
taacaaaggg 
nnnnnnnnnn 
nnnnnnnnnn 
agnagnaaaa 
gctctttctt 
agaaacatca 
catacttatt 
cctcacttaa 
ctctgtgtca 
tgcaggcttg 
caaccagatt 
ctagaatgcc 
caatccttta 
caaggaagga 
aaaagaatca 
nnnnnnnnnn 
nnnnnnnnnn 
ctcaattagn 
tctcacagca 
tccaatagaa 
caatagatct 
gaacccaaag 
aaacaaggac 
aaatctgtac 
atttacttta 
acttatagat 
ttacacatca 
gattttaaaa 
aaaaaaaatc 
nnnnnnnnnn 
nacactgttg 
tctagattac 
gtattatttg 



aggccaagat 
agtgtgannn 
nnnnnnnnnn 
ataacttgtt 
tttttatcat 
tattatatat 
caaagcttat 
ggaacgccag 
tcatgaaata 
caaaaaggac 
gacattagct 
ctttgcaaat 
aacgagtatg 
tcagttcctg 
gaactaattt 
nnnnnnnnnn 
nnnnnnnnnn 
tgtgtgggac 
atcacgttgt 
tgttttatca 
ttaaagacag 
ggtccggttc 
ccggtaaaca 
atgacacact 
aaacatatca 
atacgctgag 
agcccacccg 
ttcttcgggg 
nnnnnnnnnn 
nnnnnnnnnn 
cccctaactt 
atctcctttc 
gccttcttga 
cttccctctg 
ttgaagtcca 
aatagcattc 
ctgtgctagt 
gagtctaact 
taaaagaaaa 
gtacttttaa 
aaaaatgtga 
gatcgacaaa 
nnnnnnnnnn 
nnnnnnacac 
atatgtgcag 
cataagacat 
atagaaattt 
atgcacatta 
tttagttgtc 
tttaaaatgt 
cttctgcagt 
aaaaaaaaaa 
ccatcacgac 
ctcccttatt 
aaacatggag 
ttgagctgga 
nnnnnnnnnn 
ctgaacccca 
ttataatacc 
tatatatttt 



ttggaaacaa 
nnnnnnnnnn 
nnnnnnnnnn 
gcagaaactn 
ccagttaata 
agttgtctca 
ctagcacaca 
acagcagttc 
acaaaaacat 
actgtttacg 
gggaatgtgc 
gacccaaaaa 
cagattccta 
aaaggaaaac 
caggttagaa 
nnnnnnnnnn 
nnnnnnnnng 
agctggccgc 
gtttgctctt 
aaaggtcggc 
cgataagttc 
tgcaaccttt 
ttacggcaaa 
caaaatcaac 
gaacgttcag 
agaecctcac 
ggaaccgaaa 
ggggggccca 
nnnnnnnnnn 
nnggactaag 
ttgnaacaaa 
tatcctaccc 
aatccttgat 
cctggaatac 
tgctcagatg 
ctgctgggga 
ctgtcatata 
aaagttgcaa 
aagggcaaat 
ttcaatgtcc 
ccaggaaaag 
tgattgagaa 
nnnnnnnnnn 
ttttgctgac 
ctgtccacaa 
catacaataa 
taatacctca 
ctcactttga 
tttttccctg 
aaatatgaat 
gacatttgtt 
aaggttgaaa 
tggttctttc 
tagccccttt 
ccaaaaggtg 
gnnnnnnnnn 
nnnnnnnnnn 
gtttgtctct 
taaatgctat 
ctccaaatat 



cataaatatt 
nnnnnnnnnn 
nnnnnnnnnn 
acaacgtgtc 
tacagccttg 
actgaactca 
cattttctct 
atcgctatgc 
caaccacaaa 
tatgaaagct 
acatctggct 
ttgcttcaag 
aatatagaat 
aacagtatcg 
gcattcctta 
nnnnnnnnnn 
gacccttgtg 
ttcaaaacgc 
gcgggaagcc 
gacaactcaa 
gacgccaacg 
acatctgacg 
caaagcctga 
ggagtggaag 
cagtttatcg 
tacgtttgaa 
aacggtccca 
aacttttttc 
nnnnnnnnnn 
ctgtgctaga 
attaccaana 
cttccttcag 
gcacaaagtt 
tcttctccca 
ttaccttctc 
gctgggcgga 
atgcatgtct 
cccaggctca 
ctgctcttgg 
tcaaataaga 
tttgatggat 
gttgtannnn 
nnnnnnnnnn 
actgctaaac 
aatcccagct 
atgtaaatac 
aatattacaa 
ccattgtctc 
gaagctaaaa 
tcttttcaat 
atatatgact 
ggcatacctt 
taggtctgtg 
tgagattcaa 
tagctcttaa 
nnnnnnnnnn 
nnnnnnnnnn 
anagnacccc 
gtagttacta 
ttttgattcg 



agtcagtgga 
nnnnnnnnnn 
nnnnnnnnnn 
tanagaaaac 
ttttatgtat 
cagccaacag 
gtaaggaata 
ttgtgctttg 
actgtgaaaa 
gaaacaagaa 
acttacattt 
tattaattnt 
ccacaaataa 
aaatttatac 
ttactcagnn 
nnnnnnnnnn 
ctngaggctc 
tgacggaaca 
cgaaagacgc 
tcgacagctg 
atccgatcct 
gaaaaatccg 
caacagcgca 
atcacaaaac 
atgaaggcaa 
gacaaagcca 
aggggaaaga 
cannnnnnnn 
nnnnnnnnnn 
aacccccagt 
gacatcatag 
cttttgctca 
cattcttgct 
gatattcata 
aaagacgccg 
atagcctgaa 
taccattaag 
actcttgaag 
gagaaaaatc 
aaaaagttac 
gttcaataga 
nnnnnnnnnn 
nnnnnnnnnn 
ttctgtnana 
ttctcattat 
ttgagtgact 
tgtggtgatt 
aggagatact 
ataaacgaga 
tagcacttaa 
aaagtacttc 
cactagcgcc 
gggggactaa 
ttatttaatg 
acaagaagaa 
nnnnnnnnnn 
nnnnnnnnnn 
cccctacact 
tacagcattg 
agttagttaa 



tgaatggata 
nnnnnnnnnn 
nnnnnnngtt 
ctagacccct 
gtttctgttt 
cactgtaact 
tcagagcctt 
aaacaccaga 
atgtggcact 
ggcacagtgt 
tttgctgctt 
tggtgttaca 
ggaagactga 
ttttataaag 
nnnnnnnnnn 
nnnnnnnnnn 
gactggtcta 
gtagctgcaa 
tgatgacaca 
gaaaaacgcg 
gaaagatcag 
tttattctac 
ggtaaatgtg 
gatttttgac 
ttatacatcc 
taaattccct 
atcttttttt 
nnnnnnnnnn 
nnnnnnnnnn 
ccttactgtn 
gtgtgatctg 
attcaactcc 
taaggacttt 
aggcttgatg 
cctttgacta 
tgaaatctcc 
acatttgaaa 
tcatcaatct 
acatcaactt 
aagatccaca 
aaaaagagtt 
nnnnnnnnnn 
nnnnnnnnnn 
gnatccaccc 
taccctatag 
ttaatatctt 
atattaaaat 
tcttcatctg 
aaaaaaaaaa 
aaaatgaatg 
atctaaccct 
tatgtacagc 
ctctaacttc 
caagaactag 
aggtataaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ttaaatcatc 
ctttttattt 
atccccagat 



114 61 gcagaaactg tagatctgga aggctgatgg 
11521 cattaaatat gtacaggttt ttatatctca 
11581 aaaaaaaaaa gaaaggtttt gcataaaatt 
11641 agttttgccc aacattcctt accacattgt 
11701 atgtaatatt tgcttccaaa taccaagaaa 
11761 aataatttga tcattagttt tttaaaggcc 
11821 ccctgtaaga ttttagggca ggaagggatg 
11881 caggtggtga tgagcctttg agctgggccc 
11941 gacgaagctg taatagaaag cgtcagaggc 
12001 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
12061 nnnnnnnnnn nnnnnaacta acgtgctgaa 
12121 gcttctttgg gctataacca tgacttactc 
12181 tacagcacac agaagtacag agtaaatctt 
12241 ctaagtaatt attaaattat gttaaaattc 
12301 ctttctgttc aaatacataa cagcagcctc 
12361 aagaaataaa gtacatgtac taaagcgcaa 
12421 agataaaata aaagactcag agcaagtttt 
12481 gcagaaaggg ttaggttgga tgtgtggaaa 
12541 aaagtacaag gcacaaaaaa gaaaaaaagc 
12601 aggagagtgt gctcaacaag atttttaagc 
12661 caaacaccag . gttgtaaaat acgtgcagtt 
12721 ctaacagatg aatgacatga tcatcactgc 
127 81 gatcannnnn nnnnnnnnnn nnnnnnnnnn 
12841 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
12901 gaccccgctg tcttggctnc tagnagaatc 
12961 gtattctaat aagtagccca acanagttca 
13021 aaatctaagg atgtctccag acttcccgtg 
13081 ttaaacagcc agctataatt gtttgaatgg 
13141 tgatctgaca atctcatttc aaactgacct 
13201 cctgttgaag acagtagtac atccttacat 
13261 aaaatgcttc aaaactttca tatttagatt 
13321 ggtttgacaa ccattagcta tgcagatatt 
13381 gtttgaatgg aatgataccc ttaagctttt 
13441 ctctcttggg gtctggcatc acattatgtt 
13501 atcaaatacc actttcaatt ttcaaataag 
13561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
13621 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
13681 tgactnncta nagnatcccc tgtcatttta 
13741 aaattttgtg tttcaaatag aaaagttgag 
13801 catttatgtc ttggctaatt ttatcccttt 
13861 tcagtgaaat tgtctttttc ttattgatta 
13921 cccagttgaa tcacctttta aatctctttg 
13981 atgtcagtca tttcctttgt ggtttatttc 
14041 tttttacatt tacatctgta attagtctgg 
14101 gtcttttttt ttttttgagt aagtctcact 
14161 atctcagctc actgcagcct ccacctgctg 
14221 caagtaactg ggattacagg catgtgccag 
14281 gaaatggggt ttcaccgtgt tgcccaggct 
14341 cccgcctnnn nnnnnnnnnn nnnnnnnnnn 
14 4 01 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
14461 accccatttt ggctctgagg aaccccattt 
14521 aataccatcg atcgtcattt ttgaaccgcg 
14581 cattttgaaa acattcgcgc gctcgatttc 
14641 cttcattact tttttcaatg tgtaatcatt 
14701 cgctaactca gcatcgcgtt ttttagcgct 
14761 gaagttcgtg ccgccgccgt agtacgcttt 
14821 gttttctgtt cccgtgttgg cttcgaatac 
14881 gtgagggtct ctcagcgtat ggttgtcgcc 
14941 ctgaacgttc tgatatgttt ttccgtctcc 
15001 gttgattttg agtgtgtcat cagattttga 



tactcatctc caaactcatc agattgtata 
atcataactc aaaaagtggt ttttttttaa 
gacacagata ttccatctga attggagttg 
ccatgaaaaa tcaactgttt gagaatataa 
gataaggata agaatttatt aagaaagata 
tctaaatgcc atgaggaagc acaagcaaat 
atttcaggat gagatcatca ggggagactc 
tgaagaagag gacggatttc cataggcaga 
agagnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
cccccgctcg acnctnanag natcccccta 
atctttgtat ttccagaact tagaacaatg 
tcatgaactg aattccaagt tctaaaatat 
atattgtttt attaaatttc tccaaatgta 
ccagatacaa tactgcattt aaagagcctc 
agcatagtcc tgtacgtatc agaggactgc 
gtaaagaagg ttgtgtctga gcactgactt 
tgggagagta aagagaaaat taaaggcagc 
tgttcagcta catgaaatga ggttaagtaa 
gtacatttgg gttaaaatat tagaggggct 
gtttttacaa gcatcaaccc cctacatatt 
tgttagggag atttaccacc aactataaaa 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnntacaa catacttgca 
cccttntatc atgttgatac aaacctaata 
tgtaattaac caagctgttg ctttctgcta 
gtaaaaagtt taccttacaa aatgcaagtt 
tgagattgcc atttcagagt ttcaatgtat 
aagaaaactg aactctgaaa ggccatgttg 
tataggacaa caaattgcaa cgatctctaa 
atattgtctg acataagtga ctattgttaa 
tgcattattg aattaaattg gatctgcata 
gccttcggca tcaattaatg tgtgctcagt 
ctctttatgt gtaaatatat ttaggagcaa 
cttactcggt ttataattaa atttctannn 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnata cttcttgctg actcgcattt 
taggtgaatg aagagagtac cctttaaaaa 
caattattaa taaggttgaa cttttttttt 
ggagaatgct ttttattttt ccatagggac 
taagaatgct ttataaaaca aggctgttca 
caattaataa atattacatt gtataattaa 
agcctttggt atcatattaa tagctctgct 
aatttagtgt aaggtggaca tagagactta 
ctgttgccca ggctagagtg cagtggcttg 
tgatcaagca attctcctgc ctcagcctta 
cacatccagc taatttttgt atttttagta 
tgtcttgaac tcctggcctc aagtgatctg 
nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
nnnnnnnnnn nnnnnnnnta catattgctg 
gatctaacca actgtaaatt cgtttgagtt 
tgaatcagtg aacaagtacc atttgccgtt 
atcagttacc gtgtttgaag tgatcagcgg 
atttaactct atgataccga gggcgccgtt 
ctgctgaagc ttctggcttt ctttacggaa 
gttaaataaa gattcttcgc cttggtatcc 
aaggtattta tggcctttgt cttcaacgta 
ggatgtataa ttgccttcat cgataaactg 
gtcaaaaatc gttttgtgat cttccactcc 
cacatttacc tgcgctgttg tcaggctttg 



15061 
15121 
15181 
15241 
15301 
15361 
15421 
15481 
15541 
15601 
15661 
15721 
15781 
15841 
15901 
15961 
16021 
16081 
16141 
16201 
16261 
16321 
16381 
16441 
16501 
16561 
16621 
16681 
16741 
16801 
16861 
16921 
16981 
17041 
17101 
17161 
17221 
17281 
17341 
17401 
17461 
17521 
17581 
17641 
17701 
17761 
17821 
17881 
17941 
18001 
18061 
18121 
18181 
18241 
18301 
18361 
18421 
18481 
18541 
18601 



tttgccgtaa 
aaaggttgca 
nnnnnnnnnn 
gttgcagact 
ttcattaaaa 
ttttcatgtg 
tgcaataact 
cgtaactggg 
aatatttgta 
gtggggaata 
ataagctcga 
atcaactaat 
atttttagaa 
taatgatgaa 
ttttaaaaaa 
nnnnnnnnnn 
gctagacccg 
tcaccaatag 
acaggcctct 
acatgtatga 
ttcattttgt 
caatattaca 
tttagggaca 
agaaaaaaat 
attacaattt 
aggtactgag 
actcaaacat 
caaagcaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
agaacccttt 
ctcccactct 
gattttaaca 
ctggcctcat 
ttcttttttt 
ggctcaatct 
gccccctgag 
ttagtagaga 
gatccacctg 
ctctgttttt 
tcaatggtga 
nnnnnnnnnn 
nnnnnnnnnn 
gaganaaacc 
aattgcattg 
tgcttgttac 
tgcagttagg 
agatcgatgc 
tttggttgcg 
gaatacttat 
agtctccagg 
gctttcttta 
ggtgcgtgca 
aggctgacag 
gcttctcagg 
nnnnnnnnnn 
nnnnnnnnnn 
ttatgtgtac 
gcatttaaaa 
atgtcagtgt 



tgtttaccgg 
gnnnnnnnnn 
nnnnnnnnnn 
cgcagtccga 
tttccattcc 
aggtcaatta 
atcttcctct 
gaactactcc 
attctttcaa 
ggaatatagt 
gaccctggtc 
ttatgatgcc 
gtattttata 
ttctcatggc 
tannnnnnnn 
nnnnnnnnnn 
aacacttctc 
caaacactaa 
ctctttttat 
aactgacact 
gccagagcaa 
tgccagggac 
agataaatta 
ttttaaagat 
taaacagaag 
taagtcatat 
ttaaaacgaa 
gaaacttcca 
nnnnnnnnnn 
nnnnnnnnnn 
tgtgttgctg 
gtggtttgca 
tagctcatag 
aagtgagttg 
tttttttttt 
cggttcactg 
tagctgagac 
cagggtttag 
cctcggcctc 
taagagtttg 
ggccatctag 
nnnnnnnnnn 
nnnnnnnnnn 
ccctcctctt 
ccaataaacc 
ctattggcat 
ctgtgtatcc 
aatgattcaa 
agtttcctgc 
ttgccggatg 
agaaatgaca 
tgtgctgcat 
aggcatgtgg 
tggctgactc 
aatctgcagg 
nnnnnnnnnn 
nnnnnnnnnt 
ctggctcatc 
cactacttag 
tagttaaaat 



aatagtcagt 
nnnnnnnnnn 
nnnnnnnnnn 
ctctanagaa 
atttcttatc 
atcatcttgc 
gtagaaggag 
tacttattac 
agattattgc 
gatatttaga 
cttgttcagt 
tttagtttcc 
gttaaataat 
aaaggaaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
tanagnaacc 
caggctggtg 
tcttactgct 
gtctcttctt 
accacaaaaa 
tattaaaagg 
acaaaataaa 
ggaggataga 
atctccctga 
agccacctgg 
taccgaataa 
tatcttcctc 
nnnnnnnnnn 
nnnnnnnnnn 
gacatagtct 
ttttcatctc 
gttttgatga 
ggaagtattc 
gagatggagt 
caacctccac 
tacaggtgtg 
ccacgttggc 
tcaaagtact 
ttaagaattg 
gcttgagctt 
nnnnnnnnnn 
nnnnnnggac 
tgctcaatat 
ggtccgtggg 
gctcgtagct 
ctttctcact 
tattgctgtc 
agggaagaat 
acaccggctg 
aaaagtccgt 
cacaggtgta 
ctgcaaaggc 
actgtgtgtc 
tcacccagca 
nnnnnnnnnn 
gcttgttgct 
tagctgttat 
cacataataa 
ttaacgtttg 



gtagaataaa 
nnnnnnnnnn 
nnnnnnnnnn 
ccccctggtg 
tttttcttct 
ttttactcat 
aatggtctaa 
ttaattggtg 
ttaaagactg 
gaaatcttta 
tacaataaaa 
tattatatta 
ttgagaagcc 
atgccctttt 
nnnnnnnnnn 
nnnnnnnnnn 
ccatacaaag 
actcacactc 
atcattccag 
tctcaagtct 
ggaaaatctg 
ctccaatatt 
tgttatgtta 
aataggaaac 
taaagaaaca 
aagaacattc 
aggagctctt 
attgcctgat 
nnnnnnnnnn 
nacacttttg 
ttgactggat 
ttaacgatat 
caggcctcat 
cctcatcttc 
cttattctat 
ctcccgggtt 
caccaccatg 
caggctggtc 
gggattacag 
gtttgagttc 
ttctttgtgt 
nnnnnnnnnn 
tacgtgncag 
gtttatattg 
gagaccacaa 
ctgtaaaaac 
taaaagacac 
acctaaggga 
cttgagtgat 
tctcaacgcc 
ttctcaccat 
agtaagactt 
caccttattc 
ctgcanggga 
tgcataaccn 
nnnnnnnnnn 
gaaccgatct 
gatataacta 
ttagaagcat 
tcctctgtgt 



cggatttttc 
nnnnnnnnnn 
nnnnnnnnnn 
ttccttttct 
tccttctgaa 
gtagtaagca 
ttttaacaat 
attctctaac 
aaaacactgt 
agaccagcat 
gattaccatt 
aaatgtttct 
tcaggagaat 
tttcccactg 
nnnnnnnnnn 
nnnnnnnnnn 
tggtgttttt 
tcctgttaac 
ctaaacttct 
ctgcctcttc 
attaaacaat 
taacttgatg 
aactgtgaga 
agggagggag 
tgtgagcaga 
atgcatcgta 
aaactattat 
accaacccag 
nnnnnnnnnn 
cagaacccaa 
atttatactg 
cttttagggg 
aagtaagtta 
taggtttttt 
tgcctgggct 
caagcgaccc 
cttggctaat 
tcgcactcct 
gcgtgagcca 
ttcaaatgtt 
atagnannnn 
nnnnnnnnnn 
aaaaccacga 
cgttggcaac 
cgccttctgt 
aaggcaaagc 
aaagccagcc 
gtaagaaaac 
gggggagtga 
catgcaatat 
gacactcgga 
gtggaaagag 
acagcattca 
ggtgctgcag 
nnnnnnnnnn 
nnnnnnnnnn 
gactctanag 
aataagttag 
tgatgataat 
aaaccaagtg 



cgtcagatgt 
nnnnnnnnnn 
nnagctagac 
attgaagtaa 
actgtgtaga 
catttttctt 
gggccaaatt 
tagagtttaa 
acatgcctgg 
aattctttaa 
gggattttta 
taatgtctta 
aaaatcaaat 
cagtttcctt 
nnnnnnnnnn 
nnacactttt 
tactttttat 
gccgctccca 
gccaccttac 
caattcacct 
gcttattaag 
ctactagcat 
actgctacag 
gtataagaga 
gttctgaggg 
ttatttcata 
acaaacatct 
annnnnnnnn 
nnnnnnnnnn 
cttctctang 
caaggaactt 
aaaatttctt 
cagggtaata 
tctttttctt 
ggagtgcagt 
tctctccaca 
tttgtatttt 
gacatcaagt 
ctgcactctt 
tggtataatt 
nnnnnnnnnn 
nnnnnnnnnn 
ctcactctan 
tgaattaaga 
ggacagtata 
tcttaagatg 
ataagaagac 
aacccttttg 
gaacacactg 
taactcttgg 
gactcaccca 
ggctgatgaa 
gacccagcag 
atggaactca 
nnnnnnnnnn 
nnnnnnnnnn 
nacccctata 
tatacctaaa 
ggtggttatt 
ccatttggtg 



18661 
18721 
18781 
18841 
18901 
18961 
19021 
19081 
19141 
19201 
19261 
19321 
19381 
19441 
19501 
19561 
19621 
19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 



cccctccttg 
tttaataaga 
ttttgcttat 
gttaacagaa 
attctgtatc 
tcccctagtt 
aaattcctac 
tcctcattgt 
nnnnnnnnnn 
nnnnnnnnna 
atttggtgct 
cttgcttctg 
agactcattc 
ccagtgagta 
catacttcat 
gcacaggaag 
agccctagca 
gtgtcgaaat 
aacctgatgc 
acttatacta 
aattacaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ctttaaatgt 
ttttgtattg 
agtttttcat 
catcagaagg 
gccacttagt 
ccagagatat 
tgcacataca 
tcacccttct 
acacacaaac 
tccaggctgg 
agtgattttc 
nnnnnnnnnn 
nnnnnnnnnn 
actctanaga 
ataaattttg 
gaaatatatt 
tggaacatct 
aatttgctgc 
atgatcgggt 
ttatccaaca 
tttaggtttt 
aaacatcttt 
aatataacct 
atttttaaaa 
nnnnnnnnnn 
nnnnnnnnnn 
ccatgttttc 
acagagactt 
gtaaacatgt 
tcatccctga 
tgcccacctc 
ttattttagt 
ctaatctgtc 
tccactgcta 
gccaccataa 
tggaggccgg 
ggcaggattt 



gtaatacaaa 
aaacatacaa 
actgtattct 
gcacatcctt 
tggttacttt 
tatatctgcc 
tgaactcgaa 
tactccatgc 
nnnnnnnnnn 
ctacttttgc 
ataatgaaga 
aaaatctagt 
tgattagtga 
gagcctgtgg 
tttactccca 
ctgcaagtag 
gtctctaaca 
gtttctttac 
cttctggaca 
tgtatattta 
taaattttgt 
nnnnnnnnnn 
nnnnnnnnnn 
tttttgacta 
gcctttcaag 
taaaatgtgg 
atgcacaaaa 
ttcctttccc 
tttattcata 
cacgcatatt 
gatttcaaaa 
atatattttg 
agtgtagtgg 
ctggcttagc 
nnnnnnnnnn 
nnnnnnnnnn 
accccaattt 
taatttgaaa 
ttcctcccct 
cataacacca 
tcctttgaaa 
aatttatatt 
aggaaagaag 
cctgtttaaa 
ttagttgaga 
gatgctatct 
atatgcagat 
nnnnnnnnnn 
nnnnnnnnnn 
accattttga 
ttaaacattt 
aacaaatatg 
ttcaatctct 
caaacccggt 
tattcatgcg 
aggaaatcct 
cttccatcca 
aaactactac 
aaattctgga 
attccaggtt 



ataggagaat 
agatgactgc 
ccttgtatgt 
ctcaaacaaa 
cttttattct 
aatgtttagg 
atcattnagt 
ggcccccccn 
nnnnnnnnnn 
tagaccctca 
tccccaaatt 
gggtgtttga 
cttctcttta 
agaagacata 
ctctcttgac 
agtcatcctt 
cattcatgta 
cctacctcta 
tagataacct 
ttgaaatttt 
aatttgaaaa 
nnnnnnnnnn 
acacttcttg 
aatgaagcct 
cctacaattc 
cattaaaatg 
agaatatacg 
agaggcagag 
tacaagtaag 
ttgtttccct 
attaaaacac 
ttttcttttt 
cgtgatctct 
ctcctgagta 
nnnnnnnnnn 
nnnnnnnnnn 
ttgtagtttt 
atgaaatatg 
cattttcagc 
ttggttacta 
tctttggtag 
ttttagattc 
tgaaatgata 
aaaaataagc 
ataggcatat 
atttcagatt 
tggtaaacca 
nnnnnnnnnn 
nnnnaatctt 
ggtggagcat 
aaagaatact 
tactgaattc 
aataacatat 
ccaacctcag 
caagccatga 
gttgactctt 
agccactatt 
aaactggata 
gaccaaaatt 
tctctctggc 



agctctgtgc 
cagctgcctt 
tttgcttata 
acatataaaa 
aatcaatcta 
actgacaaat 
attgatcagc 
nnnnnnnnnn 
nnnnnnnnnn 
acttctctan 
caagtgacta 
atgaccctct 
aatcccactg 
ttgctcatac 
cagaactcat 
atgcccagga 
tatggtcata 
aactgaattg 
atagaagtct 
tgtagttttt 
tgaaatatgc 
nnnnnnnnnn 
ctgaacctat 
catattgccc 
ttggtgttca 
tctaatagaa 
gtgaaaagtt 
ccattgttag 
ggcatgtaaa 
ctctctcctt 
acacacacac 
ttttttctga 
gctcactgca 
gctgggacta 
nnnnnnnnnn 
nnngtgcact 
tataatttga 
taaacagttt 
ctctgcataa 
ttggtcatat 
ctactttcat 
atgttctttg 
tcaataatac 
cttgtctcca 
gacctcaaat 
tacaggggtt 
gctcatagta 
nnnnnnnnnn 
cttgctaaac 
catacaactt 
atataaaact 
gtttgattct 
ccaaaactga 
ctttccccat 
agtcgattcc 
cttgcaaaat 
atatcttacc 
tcttaacaga 
aaggtgctgg 
ttgtagatag 



tccttaatat 
ctgcctttcc 
ctgcattctt 
aaaaacttgg 
ttctgttttc 
aataagtatg 
tcaccctggc 
nnnnnnnnnn 
nnnnnnnnnn 
gagaacccct 
gatgtagtaa 
aatcagtggc 
gatccttggc 
ctacctcagt 
tcatatatcc 
aattgaagtg 
aactatatgc 
ggcaaaactg 
aaatgtaatc 
ataatttgaa 
nnnnnnnnnn 
nnnnnnnnnn 
tttactctan 
taatttgttg 
gtgtgggaga 
aataaattca 
cccccaccca 
tagtttctaa 
tatgcacgtg 
atgccaatgg 
acacacacac 
gatagagtct 
acctccacct 
caggcgtatg 
nnnnnnnnnn 
tcttgctaga 
attacaaaat 
atttgtatat 
gagcctagat 
tgctctcctt 
tgtgaaagat 
taatgttaac 
ttccatttga 
ttataagcaa 
acattccttg 
taatttttga 
tgggnnnnnn 
nnnnnnnnnn 
gcattttgct 
gtcacctatg 
gccacagtgc 
cacaattaat 
attcctgggc 
ctccatctgt 
atttccctca 
atatccagga 
tggtttatca 
aatttattag 
tagcgttggt 
ctatannnnn 



atctaaagac 
atcttgtatg 
tgcatagaat 
taagctattc 
atgtctgtgt 
ttttcattta 
caagaaattt 
nnnnnnnnnn 
nnnnnnnnnn 
ggtatttaat 
aagcttattt 
ttagggacac 
atcaagccag 
gcatatgtgg 
tcacttaact 
gatattggtg 
tatgaagtct 
aaaatgtttg 
ttgagttact 
ttacaaaata 
nnnnnnnnnn 
nnnnnnnnnn 
gagaccccct 
ggaaagacag 
atagagcacc 
cttgtttcaa 
tgtatcccca 
acgtactttt 
cacacatgca 
catattttag 
acacacacac 
tgctctgtca 
ttcttgttca 
cccnnnnnnn 
nnnnnnnnnn 
tcgcagcttg 
aaattacaaa 
gtgtagagtt 
ccaagcaacc 
gcacctgatc 
cttctcatga 
ttttgcaatt 
ctgccttgat 
aaccatcttg 
ataaatataa 
caggacatct 
nnnnnnnnnn 
nnnnnnnnnn 
ctanagaacc 
cttctataaa 
tttgtacata 
tgtgtcataa 
tttccacccg 
agcaatttca 
cacatccttt 
tctaaccacc 
gtttgctagg 
ctcacaattc 
tccttctgag 
nnnnnnnnnn 



22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 
23521 
23581 
23641 
23701 
23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 



nnnnnnnnnn 
nnnnnnnnnn 
agnaaccccc 
tttgttttgt 
gatctgggct 
ctgggtaact 
aggcggggtt 
ctgccttggc 
gaattattct 
ggctgtttaa 
cttgttatct 
gttctcactt 
tgctttaagc 
atacccacan 
nnnnnnnnnn 
gactaggtct 
acttctcaaa 
tcattagaga 
gatcattaaa 
gtacactggt 
tagattattt 
ggggggcccc 
atttctaccg 
cccagccccc 
ttgccgcccc 
cccccgcccg 
cttccgcccc 
nnnnnnnnnn 
nnnnnnnnnn 
ctcctggtcc 
cttcacttga 
actcttctta 
aggcttctgt 
aggatacacc 
catctgatct 
tcaactgatt 
tctcactact 
ttcttccctc 
cctcaattct 
aaaccacaaa 
tccaaactcc 
nnnnnnnnnn 
nnnnnaggcc 
cctgancttt 
tttctgaata 
cttgaatact 
tgttgaatag 
cttccagctt 
ctgtttttct 
gaatatattt 
tgtgataagc 
ttaggcgccc 
agtagttgtg 
aattatggta 
nnnnnnnnnn 
nnnnnnnnnn 
gtttctactc 
gggtctttgt 
tttttttttt 
ggttagagga 



nnnnnnnnnn 
nnnnnnnnnn 
ttggcagnat 
tttttgagac 
cactgcaacc 
gggaccacag 
tcaccatgtt 
ctcccaaagt 
aaaattttat 
aattttcaca 
ggatttacag 
actgcagaaa 
agagagctca 
nnnnnnnnnn 
nnnnnnnnnn 
gagggtcgat 
agagacattt 
aagcaaatca 
aagtcaggaa 
ggggggggcg 
ggaccgggac 
cccctgttcc 
tgtgcggccg 
cgcgccgcgg 
ccctcggttt 
tgcctttcgc 
cgcgggcccc 
nnnnnnnnnn 
nnnnnnngac 
agctacgctg 
aattctaata 
tttccacttc 
atctcagtaa 
cttaaatcgt 
atctaacatc 
gctcatggtt 
gcatttcact 
ttcatccctg 
agctcctcta 
tgagattgct 
ttaccatggt 
nnnnnnnnnn 
tagcgttnca 
gttgaaattg 
tacaatcgtg 
ctttgtttct 
gagtggtgag 
ttgcccattc 
gtgtttttgt 
cttttaatga 
actgggttgg 
tgttaaaagt 
ggtttccata 
aaaatatata 
nnnnnnnnnn 
nnnnnnnnnn 
tanagnatcc 
tgtgatcttc 
tttttttttg 
caggtccttt 



nnnnnnnnnn 
nnnnnggcta 
gtttatttcc 
acagtctcac 
tccgcttccc 
gcatgtgcca 
ggccaggctg 
gctgggatta 
tgaattctaa 
tgagcaaaat 
aagaaaatcc 
ttcctttatt 
tgatgtattt 
nnnnnnnnnn 
nnnnnnnnnn 
ccgttgngag 
ctggccaaca 
aaaccacaat 
acacttttgc 
tggatcccgt 
ggactgacgg 
cggtgcccgt 
gtcccatatg 
tcttccccgg 
agcccctagt 
ccccgtgctc 
ggcgccgggg 
nnnnnnnnnn 
acatttgcta 
agttctaaac 
ggcacctgat 
cccaaaacac 
acggtaccac 
ccctttccct 
aaaataaatc 
agttagacat 
actcttaaaa 
ccatagatta 
tagtacattc 
agactccctg 
ttacannnnn 
nnnnnnnnnn 
tgagtgctac 
tttctcagct 
ttatctgcaa 
ttctcttgcc 
agagggcatc 
agtataatat 
tggtctcttc 
tacttcacct 
tataatgcac 
agttgacctt 
acctcctccc 
tatgtcgaac 
nnnnnnnnnn 
nnnnnnnnnn 
ccccaactcc 
tcttttatta 
agataaagtc 
tcaacatttg 



nnnnnnnnnn 
cttttgctga 
atttagaaag 
tttgtcgccc 
gggttcaggt 
gcatgcgtag 
gtctcaaacg 
caggcttgag 
ctatgtatac 
taagaattga 
ttgaattgga 
attaatgaga 
catttgttgg 
nnnnnnnnnn 
nnnnnnnnnn 
ttacgcttcc 
aacatatgaa 
gagataccat 
tggagaggat 
ttgccgggga 
cagactgccc 
ccgttccttg 
tgacccttcc 
tcccccccga 
gggtgtccgg 
acatagcatg 
ttcccccnnn 
nnnnnnnnnn 
gcacactagt 
tcacatatcc 
ctttaatgta 
atacatacaa 
catctctcca 
aactgttcac 
tgagggtggt 
caaactcctt 
aaaaaatctg 
tgattcgctg 
tgcacaaatc 
gcaaatcttt 
nnnnnnnnnn 
nnnnnnnnnn 
gatgcacggt 
taaggagtct 
acagtgacta 
tgattgccct 
cttgtcttgt 
taaagaatgt 
attctccatt 
ttggtatctt 
tgaggaaaat 
gttccgcgaa 
cccattacct 
cccagttttc 
nnnnnnnnnn 
nnnnnnnnna 
tattagcgac 
aacaagtttg 
cgatgggtga 
gacaggccac 



nnnnnnnnnn 
accccagact 
attattcttt 
aggctggagt 
gattctcttg 
ccaatttttg 
cctgatgtca 
ccatactgcc 
tgtaagttac 
tcacttagct 
aaaggtatta 
gctggtcacc 
aaaagtcaac 
nnnnnnnnnn 
nnnnnnnnng 
ctctgggtga 
aaaaagccca 
ctcacatcag 
gtggacaaat 
aatgaggggt 
gccgccccgg 
cgcttatgct 
cgctcgcccg 
ggtgcgaggg 
tgccatgccc 
ggggccgccc 
nnnnnnnnnn 
nnnnnnnnnn 
acttctctna 
aatnntgtct 
gccaaaacag 
acagagcctg 
gttattcagg 
atctaatcca 
cgaaaggcag 
gttttaactc 
aaatttgact 
actcctaatt 
atcagggcaa 
cagttgcact 
nnnnnnnnnn 
nnnnnnnnnn 
nagnagaaaa 
tttggctgag 
tttgacttcc 
ggccagaact 
gccggttttc 
tttacctttt 
tttaggcctt 
ttgtgagact 
gtgaaggccc 
gtgattgttt 
gtctcccctt 
cctcgcccaa 
nnnnnnnnnn 
acactatttg 
agtagtagaa 
cgtccaaata 
cttgtcagtt 
tgactaaaac 



nnnnnnnnnn 
tgctgctgan 
tttgttttgt 
gcagtggcac 
catcaccctc 
tattttttaa 
ggtgatctgc 
cgctgtgaaa 
gatgtttatt 
tggaaacaat 
tgtcttctta 
acgttgaaaa 
taggtcactc 
nnnnnnnnnn 
ttctgnnncg 
agagaactac 
tcatcactgg 
ttagaattgc 
aggaacgggn 
gcccgggagg 
ttgtggctcg 
acggcggtgc 
gggttccgta 
caggtcccct 
cccccgcccc 
agtcccctgg 
nnnnnnnnnn 
nnnnnnnnnn 
nagnatcccc 
actcaacnag 
ttatttgaac 
ttcttcctca 
taaaaagccc 
ttagaaagat 
tgagttattc 
actcctccct 
atttaccatc 
gctttttcta 
ttttttttaa 
taaaatcaaa 
nnnnnnnnnn 
nnnnnnnnnn 
cctctgtctc 
acgacagggt 
tctcttccta 
tccaatacta 
aaagggaatg 
tctgtcttgc 
tacatgttag 
ctactcttag 
ctacgttccc 
gccaatttta 
gatccgttat 
atgcaccccn 
nnnnnnnnnn 
ctagaatcta 
gatggtatcg 
gctttttttt 
gggagctaaa 
ctgatggtag 



25861 ctgtcttttt ctttgagttg ctgcaaactt gtggtttggt aacaaggaga tttgtgggaa 
25921 ctaattgtct atttaaaatg tcaggtaaca aatataagaa tatccaggct gtaactggag 
25981 ttcttacatt tgcttagcaa cagaactgca attcagacta atattaaata gagatttttc 
26041 ctattggaat actgcttttc ctagtataac ttttttccct tttttagaga attatttata 
26101 aaggcaataa acaggcacag attgccttgc tggttaatac aattttatta aaatacttga 
26161 aaggatagtt cctataaaca tgttataata caagttctta gagatggtca cttttttatt 
26221 ttatttaagc tatagccata acaggtttat aatgggcttt aaaatagctt gacatttcat 
26281 ctttaatgag taatttccct ttnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
2 6341 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
26401 nnggcaacta tctgtgctga acctcgcagc cttactctga naggaacccc ctntttgtac 
26461 attttacttt ttcattaatt agtcttttat agtatttaat agctccatag cataccactg 
26521 aatacatgta ctataatttg ttgagtagtc cactaagtta cacatttaga ttttttccta 
26581 ctttttgatt actattaaga agaatataat taatattttt acacagtctt gattctttct 
26641 ttaggataaa ttctaagaag tacaattgca tgggctaaca gctcattttt aaagccttta 
26701 atatgctttg gcaaattgtt ctccagaact gaattaattt tatgctaatc accagtaatg 
26761 agagtgacca ttttctcaca cacaacaata ttggtcagta tcattgctgg aaacaaaacc 
26821 caagtaaaaa caaagaaaaa cacaattatt ttgatagtat ttatatttgt tagtcactag 
26881 tcatctgacc atttttttct ccttgggatt attggccatt catatttctt ttgtaagttg 
26941 tgtttataat atttgcctag ttttatagaa tggtatttat cttactgatt tgtgagtttg 
27001 tcatatggca tttgctaagt tgcagataat tttctgtcag ttttagttta ccctttcttt 
27061 ttgtttatta tgttctttac ataccatagt ttctatcttt aggtaagcnn nnnnnnnnnn 
27121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
27181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnaa ctatttgctg aaatcgagtc ctgctgctna 
27241 nagnataccc ctcaaaaana aaatgccgat acttcattgg cattttcttt tatttctcaa 
27301 caagatggtg aattgactag tgggtagatc cacaggacgg gtgtggtcgc catgatcgcg 
27361 tagtcgatag tggctccaag tagcgaagcg agcaggactg ggcggcggcc aaagcggtcg 
27421 gacagtgctc cgagaacggg tgcgcataga aattgcatca acgcatatag cgctagcagc 
27481 acgccatagt gactggcgat gctgtcggaa tggacgatat cccgcaagag gcccggcagt 
27541 accggcataa ccaagcctat gcctacagca tccagggtga cggtgccgag gatgacgatg 
27 601 agcgcattgt tagatttcat acacggtgcc tgactgcgtt agcaatttaa ctgtgataaa 
27 661 ctaccgcatt aaagcttatc gatgataagc tgtcaaacat gagaattgat ccggaaccct 
27721 taatataact tcgtataatg tatgctatac gaagttatta ggtccctcga ctatagggtc 
27781 accgtcgaca gcgacacact tgcatcggat gcagcccggt taacgtgccg gcacggcctg 
27841 ggtaaccagg tattttgtcc acataaccgt gcgcaaaatg ttgtggataa gcaggacaca 
27 901 gcagcaatcc nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
27961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aaacctacgg 
28021 tnctgaatgc cagatcncgc tanagataaa ctttaacctt acccccaccc cgtgctctct 
28081 gaaacatgtg ctgtgtcaac tcagggttaa atggattaag ggcggtgcaa gatgtgcttt 
28141 gttaaacaga tgcttgaagg cagcatgctc attaagagtc atcaccactc cctaatctca 
28201 agtacccagg gacacaaaca ctgttttagg ccgcagggtc ctctgcctag gaaaaccaga 
28261 gacctttgtt cacatgagta cctggtgccc ttatgtcacc tgttgacctg tagccactca 
28321 cagatacccc acgtgaagcc gccacacccc gcccgagccg tgaccccgtg ccggtgctgg 
28381 tgtcccgctg tccccccacc gtccccccgt cccgtcccgt ccccagcccc tttgccgtcg 
28441 ccgtagccgt tttagcacat cccagcaccc cccggtagtc cccctgttcc gctcgccttg 
28501 gcccccttac tctcccccgc cccccccgtc gtgccccgtt cccccgctat atcgcacccg 
28561 gccccctgag ccgccccccc tatcggtttc ccccccttgg acatccggtt ccctctcccc 
28621 ccgcccccgc ccccccttcc tttcgcccca tcttccattt cgccataccg ccccctgccc 
28681 ccctgatcgg cccccttctt tcccatcccc ntttttcccg nnnnnnnnnn nnnnnnnnnn 
287 41 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
28801 nnnnnnnnnn nnnnnnnnnn gcacttcttg ctgacctagt tttcttagag accccaataa 
28861 ttgaattggt ttgaaatgta atatttacaa cttacacatt tttttttggt tgtgatggtg 
28921 atggcgtttt tgatggtctt cctcgtcctt tctgtggtgt ggactgtgtg gattcaattg 
28981 cactagattc aggagattct gctctgcttt ggggaagacc caaatgtgga tagcatcaaa 
29041 ttatgattta taatatagta acgaaacaga tgcagtttag agttacacat gcttacctct 
29101 gctgtgctct ccttgacact cggtgctgtt tgcttttgga cttctgttcc gtatttccac 
29161 tttgtctttc ttcttcttcc tcctcctctg gtgctggagg tccagatttt tttttgcttc 
29221 cttttttaga agttttcatt gttggctctt cttttggtgt acctccacca agaggttttg 
29281 gtggtcggcc tcttttaccc ttttttggcg gactattctg ttcatcttca ttttctaata 
29341 tatcttcttt gagcctcttt tcctcaggcc actgctgttc atcagattct gaagccgtat 
29401 ggcctctntt ccgacttcgc tgactgcctt taggtttctg ttcctgtacc aacttagttc 



29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 
30481 
30541 
30601 
30661 
30721 
30781 
30841 
30901 
30961 
31021 
31081 
31141 
31201 
31261 
31321 
31381 
31441 
31501 
31561 
31621 
31681 
31741 
31801 
31861 
31921 
31981 
32041 
32101 
32161 
32221 
32281 
32341 
32401 
32461 
32521 
32581 
32641 
32701 
32761 
32821 
32881 
32941 
33001 



agtcatccat 
nnnnnnnnnn 
nnnngcacat 
tatacacttg 
caaaataaaa 
ctataagaca 
caaggcagat 
ttttccaaat 
aaaaaagata 
cagaaaaatg 
tcagaatctg 
ccccccgggt 
tttattttgt 
cggtcccggc 
gttgtgcgca 
nnnnnnnnnn 
nnnnnnnnga 
caaaacaacc 
taatttgtct 
taacatgttt 
ttttattact 
cttgggttaa 
ttgaggtttc 
tccacatttg 
ttactctata 
tggaatattt 
ccccctgcta 
aatcacggta 
nnnnnnnnnn 
nnnnnnnnnn 
tcaagctctg 
gtttcaacat 
tagtagaccg 
tctaatggag 
cgggtgagtg 
ggatgacatt 
acttgatgga 
gaaatactac 
agcaaggaag 
cagagatgtg 
tcattccaat 
attttaaaag 
nnnnnnnnnn 
cgttgctgac 
taagcatctg 
tgaacaaatg 
tttttttttt 
gaagagtgga 
gccatacaca 
gtagacataa 
tgaagtgctt 
gatctattga 
tgtatttgtc 
ccctgtgttg 
gtgcttcccc 
nnnnnnnnnn 
ctagccccca 
ataatccagt 
atatagatga 
tactctttat 



acctaatttc 
nnnnnnnnnn 
acgtgctgaa 
caagtataaa 
tgagagagaa 
gtatcaagta 
ggagagaata 
ctgattaaaa 
aaccatttca 
ttaaaagcag 
caatgcggaa 
cctccgctgg 
tcccggtgtg 
agggaccgtt 
cgaccgagtg 
nnnnnnnnnn 
cctagtgcag 
ttattcacat 
tgttttatta 
cataatacaa 
cttcaatcct 
tctgtaacag 
taagcattat 
tttctaagaa 
atttcactgt 
ccatttcttt 
catatgagca 
tatagacata 
nnnnnnnnnn 
nacacttttg 
tcactaactt 
ttctaaaata 
tgtgctggac 
gggcagacaa 
ccatgcagaa 
ttaaatcaga 
gatgagaaca 
atacaagcac 
cctgggggct 
acggcgacca 
gttgatgcat 
gnnnnnnnnn 
nnnnnnnnnn 
ntccccgtgc 
gtgactttcc 
taattaataa 
gggcaagctt 
cagttggacc 
tggtaacatt 
agaagaaact 
gtatagcttt 
tttgaaaaga 
ttgtgtggaa 
atggcacctt 
nnnnnnnnnn 
nnnnnnnnnn 
gtcttnactc 
aaacaaaata 
ggtcagaaga 
tattattttt 



tcctnnnnnn 
nnnnnnnnnn 
ctgcacgcat 
agtatcagtg 
aaaagggcaa 
gtcaaaaata 
gataaaaaaa 
taaccttaga 
aggcatatta 
tgcaaaaaca 
catgtgcccg 
tgcccccggt 
gtcccgtgtg 
ggcaggttgg 
ttgcgggtnn 
nnnnnnnnnn 
aatctcattt 
acttgtcagg 
tatatttgat 
tgaacaccta 
ttctacttcc 
caacaaaatt 
atcataccct 
taatcttcgt 
ggaaaatatc 
actctttcgg 
agaatttctc 
tttaactttt 
nnnnnnnnnn 
ctgaatccca 
attgtgtaac 
ggattctttt 
cctgggattg 
taagcaaaat 
aagtaaagca 
aagtcaggaa 
agactcacct 
tatgacaagg 
gaagcacagc 
ggtcatacgg 
tgnggaattt 
nnnnnnnnnn 
nnnnnnnnnn 
ggctnctnan 
ttttaaggta 
ctttctctgt 
gaggctgaat 
ttactttggt 
gactatggag 
tggaaatatc 
tatctgcggc 
atttgttagg 
ttttgctgtg 
ttttctagag 
nnnnnnnnnn 
nnnnnnnnnn 
tanagnatcc 
catccttatt 
ttgactgtca 
ttctccagct 



nnnnnnnnnn 
nnnnnnnnnn 
gagtgtngng 
ggaacttgaa 
ngaaaaaaat 
ctntgtaatn 
aaaagtattc 
gatccaagaa 
ttattcattt 
gtggaacaag 
tgcccccttt 
agtcatgccg 
ccctgccccg 
ccgccgccgt 
nnnnnnnnnn 
nnnnnnnnnn 
ggctctnana 
aaacataact 
atgtacaaag 
tgaactctta 
tccttgtccc 
taccacatac 
gtatagtctt 
tactaaatct 
tcgtaattta 
acagtgctgc 
taggttttat 
cnnnnnnnnn 
nnnnnnnnnn 
gtactgtact 
ctggaaaaat 
atttgtattg 
caacagagag 
gtgaacatct 
taatacagta 
ggccgtgttg 
acacctgggt 
aaggaaactt 
ctgtgagggg 
cctgtgagac 
taagcaaaag 
nnnnnnnnnn 
nnnnnnnnnn 
agaatcccct 
cggcggcgaa 
gaaagctttg 
aaagcctttg 
gaccccatac 
tcttgtgaaa 
ttttttcttt 
tttaaactga 
atagatctta 
aaaaaaaatt 
aaggggtttc 
nnnnnnnnnn 
nnnnnnnnnn 
cccnacagag 
ctctcaaaca 
ccaagtacct 
ttattgaggt 



nnnnnnnnnn 
nnnnnnnnnn 
gaaaacactc 
nacacagcaa 
agtaagacag 
ntggagcccc 
aaaataataa 
tctcaataaa 
ggataaaaat 
gattgataga 
tgatcgtttg 
gggcgtgagt 
gccgccgtgc 
gcggcaggag 
nnnnnnnnnn 
nnnnnnnnnn 
gaaaacactg 
gctcaccatt 
gaatatatgt 
gttccttaga 
taatatctac 
ttgtttagtt 
ggaggatttg 
aactgtggtt 
tttattcttc 
tatgaatgtt 
tcctacgagt 
nnnnnnnnnn 
nnnnnnnnnn 
ctanagnatc 
cctaactgct 
aatatgtatt 
caaaccccac 
attatagtgt 
gtaaggagtg 
agtcagaaac 
gaataacatt 
gccttgtacg 
agagtaggag 
cactgtaagg 
agacatagga 
nnnnnnnnnn 
nnnnnnnnnn 
gcacatatac 
gtgctaaaag 
gaaaaatctt 
atgcacaaaa 
atttgtggtc 
gtgtaatgtg 
tttttaatgg 
cagtacccga 
agcagtaatc 
tgttgtcaac 
acccccccat 
nnnnnnnnnn 
nnnnnnnnnn 
ttaagactaa 
gaaattaacc 
caaggcattt 
atacttgaca 



nnnnnnnnnn 
nnnnnnnnnn 
taaaagcatt 
tagaaactat 
cagctgtgat 
agtaaaacaa 
tggccaattt 
cctcaagcag 
tagtgaaaaa 
acacttttga 
ctgtcccccg 
tcgcccgccc 
gccccgagac 
tgtcggcccc 
nnnnnnnnnn 
nnnnnnnnnn 
taaattggtt 
tttaatccat 
aacatttgta 
gttcttagag 
tctcttgaat 
ttgcttgttc 
cttttatcac 
cattcatttg 
tagtagattg 
tctattgaag 
gaagttgatg 
nnnnnnnnnn 
nnnnnnnnnn 
cccctctact 
gtgttctcca 
gactgtcttc 
taagccatat 
gtaccagaag 
ctggggtagg 
atctgagtag 
ccaggaagag 
ttagaggggc 
gcgataaagt 
accttggctt 
cctgaattcc 
nnnnnnnnnn 
nagaactaga 
ccattttaat 
ggaacggcga 
tttttttttt 
tgggactgct 
acatgcttta 
cgatggctat 
ttctgatttc 
ctgtttattg 
tgtcactgtt 
aatcggtgcc 
tcgctccccc 
nnnnnnnnnn 
acactttttg 
tgctggcact 
tgggttgtgg 
ccaagggtct 
aataaaaagt 



33061 gtatacattt gaagtgtaca atgtgatatt ttgatataca tatatattgt gaaatggttg 

33121 ccacaatcaa gctaacatat ccacacctca cctagttact cgtgtgtgtg tgtggtgaca 

33181 acacttgatc tctgctctct tagcaaattt cagtatatat aataacagta ttattaacca 

33241 tagtcaccat gctgtttaga aggtatctat cccacataac tgaaactttg taccctttga 

33301 ccagtgtctt cctactcctc tctcatccat cccctggcaa ccatcattct actctctgct 

33361 tctctcagtt tgacctccta gattccacat ataagcgaga tcatacagta ttcttctttc 

33421 tgggtctggc ttgtttcatt taacgtaatg tcctccagct tcatccatgt tgttgcaaat 

334 81 ggcaggattt ccttctcttt tgnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

33541 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

33601 nnagactagc gtgcaagagt ctnagctcgg ctctanagaa tacccgtaaa ccagagacct 

33661 ttgttcactt gtttatctgc tgaccttccc tccactattg tcctatgacc ctgccaaatc 

33721 cccctctgcg agaaacaccc aagaatgatc aataaaaaaa aaaaaaaata aataaatcaa 

33781 taaattaaaa aaaaaaaaaa agcaccaatg taagttgtgt cttgagggag atatttcaaa 

33841 tctataagaa accaggaatt ctaattcttt "taattgttca gagtatagaa aagaaagcac 

33901 tcattttctt tacagccagg ataactaaca tgaatactaa aacctagtaa aagatatcct 

33961 ccccacccac ccccccaaaa aaaaacgcta agacaaatct tactaatatg tgatttttgt 

34021 ataagcaaga tagtagcaaa aagagaattt aaataataaa gcgagaatat attatggcca 

34081 aacatggtta attttaggag tatacagaga gcgttggtac aatattagta agcctactat 

34141 ttatgtttac aatttattat ataagtaggg aaaagggtat acaatagatc atttttgata 

34201 gatgctgaaa gggcttctga attttttttt ttttgagacg gagtctcgct ctgtcgccca 

34261 ggctggaggg cagtggcatg atctcggctc actgcaagct annnnnnnnn nnnnnnnnnn 

34 321 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

34381 nnnnnnnnnn nnnnnnnnnn ngaactaacg ttgctgaaat cccagctcna ctctnanaga 

34441 atccccaatg gcaacaggta tatggggttt aaaagaaata attgaagagt gattgaaact 

34501 actaaaaaaa aaaatcaaag cctcactttt tttttttgag actgagtctt gctctatcgc 

34561 ccaggctggg gtgcagtggc gcgatctctc ggcttactgc aacctctgcc tcccaggttc 

34 621 aagcaattct cctgtctcag cctcccgggt agctgcgatt acaggtgtgc accaccacac 

34 681 ccggctaatt tttgtatttt ttagtagagg tggggtttca ccatgttggc tgggctggtc 

34741 tcaaactcct gacctcaagc aatctgcccg cctcggcctc ccaaagtgca gggattacag 

34801 gcatgagcca ccgtgcccag cctcaagcct cactctttca aacaaaagca aggattgctt 

34861 tattactctt atacttttat cactttatct agcagtgaaa acctctgaaa cctgccatca 

34 921 tatttcagct aggggagaaa gaaagaatct tctatagaat gagataccgc aagccctcat 

34981 ctgtaacaac agtgagtaga agttaaaaag caagaaatga ctgactctaa ggctactggg 

35041 aaacaggaga tgggaagcca atatgagcct aagaaggaaa cgtatacgtt ggcaggatat 

35101 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

35161 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn gacatacgtg ctgacgtcga 

35221 cctcggctct anagataccc ctgtctccca aanataactg ctcaaaataa cacaaaaggc 

35281 gcaaatacaa agctattcac tgaaagactg taatactgaa aacaacccaa tgtccataaa 

35341 taatgaactg gttaaataaa ttatgataca gccacacaat aaatggggat ctctctatat 

35401 ttatatgaga taattgccag tgcatgccat tagagagaaa aaaggtgaag aaaagtgtat 

35461 atagtttatc ttattcattt tatcaaatgt gcatacatat gcagacaggt atgtgcacgg 

35521 aagggggctn tnttttnann tnngccttaa nttgtgagtg gtgctcagtt tggagcacgc 

35581 ctgcccgcct ccattccctc ttcctgcgct tttcctgccc cgtgtccttc cccacgtcca 

35641 gtttcgccgt tccgccgcct tccgtattgc tccctttgcc gcacatcctt gtcttcctac 

35701 tcctgctcgc cccccccctt cctaccttgt ccccccctgt tgtcctccgg catgtctatg 

35761 gccggccacc attgttgttc cccaggtgcg gcgccctttc ccactttccc taacaccgtc 

35821 cccccttgtc cttcgccccc gtcttcccgt cccagtgtgc cctcccttcc ctcttcgacc 

35881 gaccccctta ttcccaccac ccatttttat ccccnnnnnn nnnnnnnnnn nnnnnnnnnn 

35941 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

36001 nnnnnnnnnn nnnnngacat ggtgcagaat ctgcatctgc tcntagagna acccatcgca 

36061 attcgagtga accccaatta tgaacatagt aaatgtagtc ttcaagttca catataattc 

36121 tgaaatccca ttttatgtct tggtgagata tttaaatgac ttaatatagt taagaatcta 

36181 aaaatatatt aatcctccta aatacttctg tatgaccttt tcctgtttct gtaatagaaa 

36241 aaaatctgca aggaaagaga accacattat agtacaacag ataaacatga tcacttatag 

36301 aagataaatg gaatgttttt ataagtttac ataataagca cctgagggca agctggaaaa 

36361 ctgctagtgc agcacttcgc aatctgtgtt ccaaagacca ctagtagttc agtggagttt 

36421 aacagtgttc aggaaaaggt attttttcaa ataattttgg aaagtgttat tttacagccc 

36481 cttcataatg gatgggcatc ataataaagg ctacaataag gcctattatt tcagaaaaaa 

36541 aaagcaaaag ggggtaggga aaagaaggga taaagaaaac tctttcaatt tagcattcta 

36601 taaacatata tgacaatgaa accccttttt ctcacaatat tattaacaac tgcaggatac 



36661 cagcattcct tgaaaacaga atctgggaaa tattgaactc actggaagan nnnnnnnnnn 
36721 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
36781 nnnnnnnnnn nnnnnnnnnn nnnnnnnnng aactagtgca gactccagat ggctctanga 
36841 gaacactatg atttgtttaa atgtactttt gtaatactga aataatctgt attacagcgt 
36901 gaaataacta agaagttggg caaccccaaa cagcctacaa atcctttcct ggaaatgatc 
36961 aagtttctct tggagaggat agcacctgtg cacatagata ccgaatctat caggtatttg 
37021 tatataaaat actaagttct cattgctgtt cattaatgtg tacattcagg aatcagtcat 
37081 tgttgagtgc ctgctatatc tagctacaca tctgtgtgta ttgtgtactc ctctgtatat 
37141 actcctctgc atatacctgt atttgtatat gcacacacac acacacagac acacacatat 
37201 atatatatac actcatctgt acctgtatat ttgttgttat tcttgttatt tatcttaatt 
37261 ttaaggattg gttcaaattg tgtgagaaat tgccaatcta agcctaattt tatctgttca 
37321 cggtatacca gacactcttt gttttcgcct acttgtatag ttaaggtaat ataaataaac 
37381 tctatttata ccctcgggaa actaggcgaa tctcccgccc ccgcacagcg gaccaccagg 
37441 ccatgtcctc cggaaactgt tcaggagacc ccaaatcagc ttaattcaac cccctgtttc 
37 501 cgaacccaag ttatcccgnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
37 561 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnag 
37 621 actatgtgct gcaacctgat gatgtangga acaaccagac acaaacaatc tgctgttctc 
37 681 actttaaatg ggaggaacat caaacacaca tgggtataca cnatggaaac aacagacaat 
37741 gcggactact gggggagctg gagaaaggag gatgtgggct gtaaaacgac ctattgggta 
37801 ctatactcac tatgtggatg caatataccc atgtaacaaa cctgcacatg taccctctgt 
37861 atctaaaata aaagttgaat ttaaaaaaaa caaaaaagca aaaacacact tttaaaaaaa 
37921 cgaagtttat atatgtacgt ggtactatac aaataaattc aaatgatacc aaaaagtata 
37981 aaataacctc cctttctata ctaggttaac ataaatactg ttaagatttt gcatattctt 
38041 caaaaaaact ttctatgcat ataaaaacaa atgtgaacat taatctgttt tcaaaactta 
38101 aagattccat aatgctgctt tatatacttt gcctttttca ctcaataatg tattgtggaa 
38161 actcttccat actaactcag acctcatcat gtttaatgac tatgtattcc attgtatcat 
38221 acaccataat acaacagttt tacaattgct acacactgag gttggaaact agtttatgtg 
38281 tactaacaca gattaccccc aacaagctaa agaatgaaag actaagacag nnnnnnnnnn 
38341 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
38401 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn aactttcgtt gcaaaaccgc tcttgctcta 
384 61 nggaataccc caccgctcan aaatctgatg gcaccctaca ggaacatgac ggtatctgcg 
38521 agatccatgt tgctaaatat gctgaaatat tcggattgac ctctgcggaa gccagtaagg 
38581 atatacggca ggcattgaag agtttcgcgg ggaaggaagt ggttttttat cgccctgaag 
38641 aggatgccgg cgatgaaaaa ggctatgaat cttttccttg gtttatcaaa cgtgcgcaca 
38701 gtccatccag agggctttac agtgtacata tcaacccata tctcattccc ttctttatcg 
38761 ggttacagaa ccggtttacg cagtttcggc ttagtgaaac aaaagaaatc accaatccgt 
38821 atgccatgcg tttatacgaa tccctgtgtc agtatcgtaa gccggatggc tcaggcatcg 
38881 tctctctgaa aatcgactgg atcatagagc gttaccagct gcctcaaagt taccagcgta 
38941 tgcctgactt ccgccgccgc ttcctgcagg tctgtgttaa tgagatcaac agcagaactc 
39001 caatgcgcct ctcatacatt gagaaaaaga aaggccgcca gacgactcat atcgtatttt 
39061 ccttccgcga tatcacttcc atgacgacag gatagtctga gggttatctn nnnnnnnnnn 
39121 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
39181 nnnnnnnnnn nnnnnnnnnn nnnnnnnnna actttttgct gaatctnatc ttactctang 
39241 agaaccccaa gtcaggagat cgagaccatc ctggctaaca cgngtgaaac ccagtctcta 
39301 ttaaaaatac aaaaaattag ccaggcgtgg tggcaggcac ctgtagtccc agctacatgg 
39361 gaggctgagg caggaggatg gcctgaaccc gggaggtgga gcttgcagtg agctgagatg 
39421 gcaccactgc attccagcct gggcgacaga gcgagactct tgtctccaaa aaaaaaaaaa 
39481 aaaaaaaaag atttaaaggc cctgatttta tattctttgt gttgaaatat ttgcaatggt 
39541 atgaccactt tctttacatt ttaatgaaaa ctgtcagtga ttactgtaaa tttaatattt 
39601 tggtctcatt tttctgattt gttaaaagag gatataagtt gtgataagtg tttgttaagc 
39661 ttcagatctc cagtctgatt tcttgattct tatttagctc aataattacg ttcagtcagt 
39721 caagttaagc caactttttt cttttaaata caatggaaag tttttagaca gtaggaaaga 
39781 atatcagttt aaattatgca acttaattta aagtttaata ttttaaaaca aaaatacttt 
39841 tactggatat atatttatta tatctttaaa ttaagacttg gattagctgg ggatgtattt 
39901 tattgggttt tannnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 
39961 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnaacacttc 
40021 ttgctgacac cgnatctttg actctanaga atcccctctc ctattcaact attcttcatt 
40081 tacaacaata gtgacaccag taattatgag aggctgtcat atatcaaagc agctcaggtg 
40141 ggatcaacat acattggttt tgaaatgttt ccttagctcg gttctcccaa acacataatt 
40201 tctgtttcag cagtacaaag gcatgtttta aaatctataa cataaagaat aaggaatagc 



40261 tttgtatttt tgtcattgat taaattgcac cagaaatgtt tatggaaata ttaagaacat 

40321 tttataaaac atacgaaaaa gtgattgtga aggattttta tttgcatgat tatagttaag 

40381 caaatttcat ttcacatttt tgtaatgctg tacagcagtt cacaaaaaaa tgttcattgt 

40441 agattttgtt atgttcaatg ccaatgagtc tgtaatttta cgacctgtct gttctttttt 

40501 tgggtggatt acgatgtaaa tttttctttt ctttctttct tttttttttt ttttgngtag 

40561 aaaaataggt gcaataatga tcaaagtttt gatgtcttga gtctccatct taggggatta 

40621 tcttacgttt aagcttaaca tttcatgtag taagatttgg agagccacat actttacagt 

40681 aaaattaagt gtgtataaaa cattaattgc tannnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 0741 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

40801 nnnnnnnnnn nnngactaga tgttgctaaa aatctcacac tcggactctn anagaatacc 

40861 cctattgtac aagtctgcaa aactagtatc ataacctcca ctagacatgt gaaaggtagt 

40921 tcgtaatacg aactataaac aaatacacac acacacacac acacacacac acacacacac 

40981 acacacagac tgaaagggtt tataagaaat tacagctaga aaataacaat atttaatgta 

41041 tgtatttttg gccttcatat cttggtagtg gattgacatg aggtctacac tacaggccat 

41101 agatggtttt aataaatcaa gagtctatac caatgatgag caatcttata aaactcctaa 

41161 ttcaataaat tataggctga attttttaaa aataaaaagt tttattaaat cccatggaaa 

41221 agatgaatag gcttaaaatc tgaattatca tctttacaga acagaggaaa gtccataaaa 

41281 actagcactg ccttttgcca tcagggaaac agctttcaaa cctttttttc tcctcaaatt 

41341 agattatgtg cagctgtcca caaaatccca gctttctcat tattacccta tagtctcaca 

41401 gcacataaga catcatacaa taaatgtaaa tacttgagtg actttaatat ctttccaata 

41461 gaaatagaaa ttttaatacc tcaaatatta caatgtggtg attatattaa aatnnnnnnn 

41521 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

41581 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnaactag gtgctgactc cggtcggctt 

41641 gngagacact ttgccttatt ttagtttaat tatttaactt ttaattggct atgtaatgtt 

41701 ttgaaaacct ggtttttaag aatgatgtaa ataagttcaa gagaggaaag ncaaaaacct 

41761 aacagactct agttataaaa aaattgaaac attattcgct aataaaacca ccagtgtagc 

41821 tctccctctc cctctcccta tccctctccc tctccccacg gncnccctct ccctctcttt 

41881 tctctccctt tctttttctc ttgtccctnn cttttctctt cttttccctt ctttnctctt 

41941 tcccctcttc ctcccttttt ttccttnttc tccttttccc cctcccttcc cctctcntct 

42001 ccttcccccc ccctctnccc ttccctttct ccnctttccc cccctcttcc ttttcctttt 

42061 tctccctttt ccctcctcct tttccccttt cccctcctcc ctccccccct cccccccttn 

42121 ttcccccttt ttcttccctc ctctcttttt cctccccctt tctttttccc cctctttctt 

42181 tcctnttccc ctttcccctt ccccgccgcc cgttcccttc gccccctttg ccgttgccgg 

42241 tggcccgccg gggcccggcc ggctgccgct ctggcgccgc cccgcttttt tcccnnnnnn 

4 2301 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

42361 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnggacc tacgggtctn gagnggcggg 

42421 acttcagtct ngnggatgac actttccaca atactcacaa atcagtaata taaataccat 

42481 tctataaaac taggcaaata ttataaacac aacttacaaa agaaatatga atggccaata 

42541 atcccaagga gaaaaaaatg gtcagatgac tagtgactaa caaatataaa tactatcaaa 

42601 ataattgtgt ttttctttgt ttttacttgg gntnnggntn nttttttgtt nggtggtgtt 

42661 gtgtttttnt gnggngttgt gggtttttgg tgtnttgttt ggggtgtttt gtgtnttggt 

42721 nttggttgtg gggtttttgt gggtgggggt ttggtgggng gggtggtggt tgtggggggt 

42781 tggtttgggg ttgggttttt gtggtgtggg tggtggtgtt tggtgtttgt tgggtgtttg 

42841 gggggtgtgt ttggtgggtt gttttttttt tgtggtgttg gttgtttgtt ggggttntgg 

42901 gggtgggggg gtggtggggt gtgggttgtt gtgggtgggt ggggggtttt tgttttgttg 

42961 ttgtgtgggt ggttgtgggg tgtgttgggg gtttggggtg gtttttgggg gtttcggtgt 

43021 gttgtgggtg ggttggggtt tttttgtgtg ggtggggggg tttgtgggtt tggtgggtgt 

4 3081 ggcggttgtg cccnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn 

4 3141 nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnnnnnnnn nnnngaacta 

43201 cgtgctgaaa actagatttc tgtanggnaa acccaattca cttgtgtctc ataatactca 

43261 tctctctgag ggcacttcag tcaaaactag cttattaata agattcttct agcacctcta 

43321 aaactggacc taacgttaca cattatccat gccacaacaa atagcaaaag gcctccatta 

43381 caatcatact aataggtctc aagagctcaa aaattcaaaa cagttctatg aaaaatagag 

43441 tagactttgt tctgtactat cccaaaaggt aaaagcagat gcaaaattta taggataaca 

43501 gaatattaag aattaataaa aaccatgttt cccaactttt gggggatggg aacaccaagg 

43561 tataagtgta gtacactata ggtgtgggta tgagtgacct agttgacttt tccaacaaat 

43621 gaaatacatc atgagctctc tgcttaaagc attttcaacg tggtgaccag ctctcattaa 

43681 taataaagga atttctgcag taagtgagaa ctaagaagac ataatacctt ttccaattca 

43741 aggattttct tctgtaaatc cagataacaa gattgtttcc aagctaagtg atcaattctt 

43801 aattttgctc atgtgaaaat tttaaacagc caataaacat cgtaacttac agtatacata 



43861 
43921 
43981 
44041 
44101 
44161 
44221 
44281 
44341 
44401 
44461 
44521 
44581 
44641 
44701 
44761 
44821 
44881 
44941 
45001 
45061 
45121 
45181 
45241 
45301 
45361 
45421 
45481 
45541 
45601 
45661 
45721 
45781 
45841 
45901 
45961 
46021 
46081 
46141 
46201 
46261 
46321 
46381 
46441 
46501 
46561 
46621 
46681 
46741 
46801 
46861 
46921 
46981 
47041 
47101 
47161 
47221 
47281 
47341 
47401 



gttagaattc 
nnnnnnnnnn 
nnnnnnnnnn 
tgtngnggnt 
tacagtgtac 
acgcagtttc 
gaatccctgt 
tggatcatag 
cgcttcctgc 
attgagaaaa 
tccatgacga 
tttgttctga 
gattttctca 
cacagttgat 
tcatcattga 
nnnnnnnnnn 
nnnnnnnnnn 
ccatttgctt 
gtttacctcc 
ctcatcattt 
ttttcataat 
gcagcaacag 
aggataatgt 
ttcatatttt 
caatatctaa 
taattttaaa 
ttgagtcttc 
tcttcataag 
nnnnnnnnnn 
nnnnnnnnnn 
tgctganagn 
tgaacataaa 
cctatgctga 
ctaccttcaa 
tacattcaaa 
gaaacaatcc 
acagacatta 
acaatgtgtg 
catagctctc 
cattttagat 
ctggctgttt 
tcctgaagct 
nnnnnnnnnn 
tgacactcta 
ttctttgcct 
atgtagccag 
agtagcaaat 
gtgccccacg 
ttgtatttta 
aattcactat 
aaaatgtctt 
tcccactgag 
agaataggct 
tatttccatt 
tttttttttc 
nnnnnnnnnn 
nnnnnnnnnn 
ggtaacattt 
acttataaac 
agaacaggtg 



aataaaattt 
nnnnnnnnnn 
nnnnnnnnnn 
aacactctcc 
atatcaaccc 
ggcttagtga 
gtcagtatcg 
agcgttacca 
aggtctgtgt 
agaaaggccg 
caggatagtc 
cctactgagg 
tactttttga 
ttccttctct 
tgagggttga 
nnnnnnnnnn 
nnnnnnnnnn 
tgactcctgt 
tgattatccc 
ctttgcctct 
gtagccagat 
tagcaaataa 
gccccacgaa 
gtattttatt 
ttcactatat 
aatgtcttgg 
ccactgagat 
aataggcttt 
nnnnnnnnnn 
nnnnnnnnnn 
atcccctctt 
gaaaaatttt 
ctgcttttcc 
gttcaagttt 
ctgaaaaata 
tgtcaacatt 
ataagtaagg 
tagaacctca 
agtgtatgta 
ttaaaaattt 
acatactctc 
nnnnnnnnnn 
nnnnnnnnnn 
gtctttctct 
ctttgttggt 
attgtgaagt 
aatataaacc 
aagctctgca 
tttcattgat 
attgataagt 
ggtcattcct 
atagattttt 
ttgcaccttt 
attttctaat 
ttttgagaca 
nnnnnnnnnn 
nnnnnnnnta 
tttggccagc 
gggcggtaac 
aaaaggcata 



tagaataatt 
nnnnnnnnnn 
nnnnnnnnnn 
ttggtttatc 
atatctcatt 
aacaaaagaa 
taagccggat 
gctgcctcaa 
taatgagatc 
ccagacgact 
tgagggttat 
gtaatttgtc 
actgtaattt 
ttcccttcgt 
ttatcacagt 
nnnnnnnnnn 
aactctatct 
tccttacacc 
ttgagttttt 
ttgttggtct 
tgtgaagtac 
tataaacctt 
gctctgcatc 
tcattgattt 
tgataagttt 
tcattccttg 
agattttttt 
gcacctttta 
nnnnnnnnnn 
nnnnnnnnnn 
ttacatactc 
aagaattctg 
tattaaaaag 
tttccttgat 
catgaaaccc 
ttggtgaaca 
atataaaata 
tttgtcaaat 
tagtgtatgg 
ttaaaaacac 
tctgagcttt 
nnnnnnnnnn 
nnnnnnnnnn 
nanagnaacc 
ctcacattta 
accatgattt 
ttatatctct 
tcatgggtca 
ttctttttct 
ttttatattt 
tggctcttct 
ttaaggcctt 
taaaatgttt 
tggtatagct 
gtcttgctct 
nnnnnnnnnn 
cttcttgcta 
actttgaaat 
ttttttactt 
ggctctggaa 



ctttcacagc 
nnnnnnnnnn 
nnnnngaact 
aaacgtgcgc 
cccttcttta 
atcaccaatc 
ggctcaggca 
agttaccagc 
aacagcagaa 
catatcgtat 
ctgtcacaga 
acagttttgc 
ttaaggaagc 
catgtgacct 
ttattactct 
nnnnnnnnnn 
tgctgacacc 
tgtatccagt 
ctactttttt 
cacatttact 
catgatttta 
atatctcttc 
atgggtcact 
ctttttctaa 
ttatatttgg 
gctcttcttg 
aaggccttcc 
aaatgtttnt 
nnnnnnnnnn 
nnnaacacat 
agttatttta 
cagtactaag 
aatttcttat 
aacaatcata 
taaagaaaaa 
ctcctaaaat 
cacagtattt 
attgctaagc 
ctataccaaa 
cttaacaaca 
aaacctatcc 
nnnnnnnnnn 
nnnnnnnnnn 
cccaccgcgt 
ctcttgcccc 
tatctacccc 
tcaataattt 
ctcttgttta 
aatttttatc 
ggtccaaaaa 
tgttattttc 
cctttcaata 
tatgggtttt 
gctatgcttt 
gtgcagtnnn 
nnnnnnnnnn 
gaaacccagt 
ggccctctgt 
tggaatgacc 
tcagaaaaag 



gggcagtatg 
nnnnnnnnnn 
atgtgctgaa 
acagtccatc 
tcgggttaca 
cgtatgccat 
tcgtctctct 
gtatgcctga 
ctccaatgcg 
tttccttccg 
tttgagggtg 
tgtttccttc 
caaatttgag 
gatatcgggg 
nnnnnnnnnn 
nnnnnnnnnn 
cagtcttctc 
ctgtcatcaa 
ccaccggtct 
cttgccccac 
tctacccccc 
aataatttac 
cttgtttagc 
tttttatctt 
tccaaaaatt 
ttattttcgc 
tttcaataaa 
atgggtttta 
nnnnnnnnnn 
ttgctagcac 
aaagaaaaca 
tcaatgaaaa 
tccctcatta 
ccaatataga 
actcagcatc 
ttctctagac 
ggaaacattt 
actattcttt 
actaatctca 
ataccatgaa 
tttcataatc 
nnnnnnnnnn 
nnnnnnnnnn 
ctcccttaag 
acctcaaatt 
ccagcagcag 
accatcgctt 
gctaattctt 
ttttgcctgt 
ttctcaaagt 
gcttattcag 
aagtttttat 
attattattg 
tctttctttc 
nnnnnnnnnn 
nnnnnnnnnn 
ttgctctana 
gaagaacggt 
tgcaaggagc 
cacgtttagc 



gctcnnnnnn 
nnnnnnnnnn 
acctgatttc 
cagagggctt 
gaaccggttt 
gcgtttatac 
gaaaatcgac 
cttccgccgc 
cctctcatac 
cgatatcact 
gttcgtcaca 
agcctgcatg 
ggcagtttgt 
ggttagttcg 
nnnnnnnnnn 
nnnnnnnnnn 
tanagnaccc 
atcctgttga 
cccttaagtt 
ctcaaattca 
agcagcagca 
catcgctttt 
taattcttcc 
ttgcctgtac 
ctcaaagtct 
ttattcagta 
gtttttattt 
ttannnnnnn 
nnnnnnnnnn 
gcagtcttgc 
actgaaacaa 
taaataacca 
aattgacttt 
aaatgaaaaa 
tcactaccct 
atattattat 
ttttcaacca 
ttaacagtca 
tactgataag 
tatcGctgtg 
tgctcagaga 
nnnnnnnnnn 
acacttttgc 
ttctcatcat 
cattttcata 
cagcagcaac 
ttaggataat 
ccttcatatt 
accaatatct 
cttaatttta 
tattgagtct 
tttcttcata 
cttacattgc 
ttttctttnt 
nnnnnnnnnn 
nnnnnnnnnn 
gaacccccaa 
tctctagcat 
tgcatcttcc 
tgcaggctct 



47461 
47521 
47581 
47641 
47701 
47761 
47821 
47881 
47941 
48001 
48061 
48121 
48181 
48241 
48301 
48361 
48421 
48481 
48541 
48601 
48661 
48721 
48781 
48841 
48901 
48961 
49021 
49081 
49141 
49201 
49261 
49321 
49381 
49441 
49501 
49561 
49621 
49681 
49741 
49801 
49861 
49921 
49981 
50041 
50101 
50161 
50221 
50281 
50341 
50401 
50461 
50521 
50581 
50641 
50701 
50761 
50821 
50881 
50941 
51001 



taacacaggg 
gcaagcacag 
gggaaagatt 
taccctgatt 
ggcctaacgg 
ttccctggcc 
gcctctgggc 
agctcttcat 
nnnnnnnnnn 
nnnnnnnnnn 
cctaacatat 
ttgtttaaaa 
aaaataacca 
tctaatttaa 
aatattatgc 
tgtcaagtct 
tttactgtga 
ctatttaaga 
aacttttttt 
ttctgtatat 
ggcaaaagaa 
nnnnnnnnnn 
nnnnnnnnnn 
gtngnagaat 
tttttaataa 
taattctcta 
tctctattta 
ttacagcctg 
caaatctcct 
tcaggtttta 
tcccaactga 
aggctatttg 
catcttctac 
actgttctct 
attcacatan 
nnncinnnnnn 
cagaaaaccn 
cacattagta 
ttgattaatc 
gctttaaacc 
gaaacagttt 
aacaacaaca 
tctttaaggg 
ggagcctgaa 
tttacaagtg 
aagactaaaa 
caaatggcta 
aggaagagac 
nnnnnnnnnn 
nnnnnnnnnn 
ctctnanaga 
accaccgccc 
aaacgtttca 
gctgtcttct 
gctcccctgt 
aatcatgcat 
aaaaaaggac 
agaattatca 
ccgtctgagc 
acatacacac 



ggccatattc 
ccatagcctt 
taagacttaa 
tcctcagtgg 
ccctgcacaa 
cccagctcca 
tattgcacct 
atgcctgcct 
nnnnnnnnnn 
nnnnnnnnnn 
tttgactggt 
tatttacaat 
ctgtacaaaa 
aaaaaaaaga 
aatccaaaga 
ccatcactat 
ttatttttca 
gacaagtcga 
aaaaataaaa 
atgaagataa 
tgatttgact 
nnnnnnnnnn 
nnnnnnnnnn 
aacacttatc 
aattgtatta 
aaaaagggaa 
atattagtct 
gatattctta 
tgttaccaaa 
gtcagtggcc 
caagtcaccc 
gacgcaaact 
tactgtcgct 
ttttcaataa 
nnnnnnnnnn 
nnnnnnnnnn 
agactgacgg 
taattttatt 
tcaaggaaag 
ccaccatatc 
gtcaaattgg 
gaaagacaac 
tgttgctcta 
actcctgggc 
tgagccacca 
atggaaagga 
cactattctc 
atatttcata 
nnnnnnnnnn 
nnnnnnnnnn 
ataccccttt 
tggggctgtg 
caagcgcctc 
ccccggggcg 
ctcactgtgc 
ttcctaagag 
cttagagttc 
gtagacagaa 
ccccactacc 
acacatatac 



atctctgtat 
ttaacatgtt 
tatttttaaa 
cttcccactg 
cctggtactt 
gttatagtct 
gtgattccct 
gtcacccttc 
nnnnnnnnnn 
nnnnnnngac 
cactcacact 
gtataacagt 
gcatattcca 
aaaagaaaca 
actgtgaaat 
gcaatattgt 
ttccaagtag 
caaataaaac 
tgtatttaat 
tctcgaatgt 
aaagactagt 
nnnnnnnnnn 
nnnnnnnnna 
acttgtatta 
accagcaagg 
aaaagttata 
gaattgcagt 
tatttgttac 
ccacaagttt 
tgtccaaatg 
atcggactct 
tgtttaataa 
aataggagtt 
tcaagatatt 
nnnnnnnnnn 
nnnnnnnnnn 
tnanagnaaa 
ggatgtagat 
cagacataaa 
aataattaca 
attcaaaaca 
taggacattt 
ttgcccaggc 
tcaagtgatc 
tggccagcca 
aaaatatatg 
agacacaaaa 
acaacatcgt 
nnnnnnnnnn 
nnnnnnnnnn 
cccatcccac 
cagcttgacc 
tgccggggaa 
acagtccacg 
acggcatatg 
aaccaaaggg 
tggctttgta 
aggggtagtt 
ccactcaccc 
acgtgcacac 



tgttccagtt 
tcctggcttt 
atgcagatct 
cacttggaat 
tctactctca 
ctcttcccta 
ttgctcggaa 
agatatcagc 
nnnnnnnnnn 
tagttgcaga 
ttctacgaga 
accctgcacc 
gtaaaaatat 
gcaccattga 
agtaattgca 
tgttagcaat 
ccatcgaacc 
actggtacat 
ggatgctttt 
tttaaagatg 
ttgttctttg 
nnnnnnnnnn 
ctaacttgtg 
taacatgttt 
caatctgtgc 
ctaggaaaag 
tctgttgcta 
ctgacatttt 
gcagcaactc 
ttgaagagga 
atctcaaaaa 
aagagaagat 
ctggggaact 
tagtttactt 
nnnnnnnnnn 
nnnnnnnnnn 
caccggcagt 
aaaccaatag 
gaatagaaga 
gtaagtgtaa 
accaatataa 
atttaaaaat 
tggaatgcag 
ctcccgcctc 
agaaactcac 
ccttacaaac 
gcctgcagaa 
ctactggcaa 
nnnnnnnnnn 
nngacttacg 
tcacagtctg 
ctgccttctc 
ggagagggcg 
tctcgagagc 
aaaactgaag 
aatttcttct 
gcccagagaa 
gtggacctta 
cccaacacac 
acatagacac 



ttagttcatg 
actaataaat 
gattctgtca 
aaattcacac 
gacctcactt 
gaacacccca 
ctctctttcc 
tcagctgnnn 
nnnnnnnnnn 
actctagttg 
tttctaaatg 
agacatgtga 
atgcacatgt 
acagttacac 
tacctaattt 
cttaaggtag 
atcattttaa 
tttgaaagaa 
taagaaaaga 
tttcaaacat 
ctgttttaan 
nnnnnnnnnn 
ncaaaaaacc 
ataggaacta 
ctgtttattg 
cagtattaca 
agcaaatgta 
aaatagacaa 
aaagaaaaag 
cctgtcctct 
aaaaaaaaaa 
cacaacaaag 
tgtgagaggt 
aggagcttat 
nnnnnnnnnn 
nnnnnnnnng 
atagtgctgt 
taaagataaa 
ggaaaactat 
atgctctaag 
aagcaaacaa 
aaataaataa 
tggtacaatc 
atattctcaa 
tttaaatatg 
actaggacca 
caaggaagat 
ggctgtgggg 
nnnnnnnnnn 
ttgcagaaat 
ttagcaccat 
catcctgccc 
aaagctcaga 
gcgccggctc 
aggacaacgg 
attttcgttc 
acctatggat 
aggtttgacc 
acatatacac 
acacacgcac 



tgctgccaag 
ccacaaaaat 
ctctcacccc 
tctttgccat 
tattcctcct 
gacttgttct 
ctagagcttc 
nnnnnnnnnn 
nnnnnnnnnn 
acttaagaac 
taagaatcta 
ccgccactgt 
acaaagcaat 
caagagctga 
tcccctgttc 
aagttcctga 
tagcctgaat 
tttctgtatc 
aaaaattctt 
ccaagaaaat 
nnnnnnnnnn 
nnnnnnnnnn 
ttacactttg 
tcctttcaag 
cctttataaa 
ataggaaaaa 
agaactccag 
ttagttccca 
acagctacca 
aacctttagc 
aaaaaaaaaa 
accccgatac 
ggaagtacac 
tcaaaaactg 
nnnnnnnnnn 
aactacgtgn 
aaagctttta 
atcataatac 
caagatggta 
cactcccatt 
ggaaacaaaa 
ataaactttt 
atagctcact 
agcacggaga 
aagtcccagg 
aagaaaagca 
gaccagagtt 
annnnnnnnn 
nnnnnnnnnn 
ctncatttga 
tggatatggc 
caactgtagc 
agcacagtct 
ccagggctct 
gctgtttctc 
ttataattta 
ttaatcatca 
agaaaacacc 
tcatacacac 
acacacatat 



51061 
51121 
51181 
51241 
51301 
51361 
51421 
51481 
51541 
51601 
51661 
51721 
51781 
51841 
51901 
51961 
52021 
52081 
52141 
52201 
52261 
52321 
52381 
52441 
52501 
52561 
52621 
52681 
52741 
52801 
52861 
52921 
52981 
53041 
53101 
53161 
53221 
53281 
53341 
53401 
53461 
53521 
53581 
53641 
53701 
53761 
53821 
53881 
53941 
54001 
54061 
54121 
54181 
54241 
54301 
54361 



acacacacac 
acacacaagt 
nnnnnnnnnn 
tacggtcntg 
actcttcctg 
tagaaaggag 
ttatgtgaca 
ttgaggccca 
cgatcacggc 
ctcggcgccg 
cctttccccc 
atcgttgttt 
ccgtccttcc 
tgatattgcc 
tccctcccgt 
cccccccccc 
nnnnnnnnnn 
ngactgagtt 
ctggcatcac 
tttcaatttt 
ccttggcatt 
gaatatgttg 
tcgtatttac 
gtgattcatg 
gaaaccaagg 
agcagagcta 
cagagttaag 
cacacagaaa 
aggcccttgg 
nnnnnnnnnn 
nnnnnnnnnn 
ttgccctgca 
aaaaaaaaga 
gatgtatgtg 
gagccacagc 
aatacccccc 
agcttctcat 
cagaaatctc 
aatggctgtg 
tcatcctctc 
tctagttaag 
tgatgggata 
nnnnnnnnnn 
nnnnnnnnnn 
tgcgttagga 
ctcctaaaca 
ttagaataaa 
gttttgtttg 
aaacatacaa 
tggcaggcat 
actgtggccc 
tggcgttttt 
tggttgcgtg 
ggttggcccg 
cccggccccc 
ttttcgtcgc 



atatacacgc 
acaccnnnnn 
nnnnnnnnnn 
aggtgtcgat 
tggctaaagc 
cagcaggtcc 
ctagcaacag 
tacaattgac 
tactggagga 
cccaagttcg 
acccggttta 
ttaccacgtc 
ccctttctcc 
ccttttctct 
ttttcccttt 
catttttccc 
nnnnnnnnnn 
gcagagtctn 
attatgttct 
caaataagct 
tacacataaa 
agttaagaag 
atcacagctc 
ctcgctcagg 
ccagcacaat 
gatacagaga 
actaatgctg 
tataacctgg 
gggggaggcg 
nnnnnnnnnn 
nnnnnnnnnn 
ctccagcctg 
agtggtttat 
tttaaaggtc 
taggagaaga 
accaagatct 
cctcaactta 
caggaagaac 
cagtgggagg 
ctgtcatttc 
gcaattaact 
tgaattatac 
nnnnnnnnnn 
nnnnnnnnnn 
gtaagctaac 
ttggcatata 
agaaagtaaa 
agatggatgt 
ggagaataca 
ctttgtatgt 
ctggggtgtc 
cagcggcccg 
tcgtcccgcc 
ccccccggtg 
gtatccgtgc 
ctttttttgc 



acacacttac 
nnnnnnnnnn 
nnnnnnnnnn 
tcagtctgna 
agggatgcaa 
caggaatggg 
ccctgggtgg 
attcaaaccc 
ctacccgagg 
acattcagct 
ctattcgcat 
tcccactccc 
ccctttttcc 
cccccctccc 
ctttattccg 
cnnnnnnnnn 
nnnnnnnnnn 
agctcngctc 
ctttatgtgt 
tactcggttt 
acactaacaa 
cacaatacca 
tgagaacaaa 
ccgaggcaga 
attcctacta 
gagagagaga 
gcactataat 
gttgtggatc 
aggggcggag 
nnnnnnnnnn 
nnngaactag 
ggtaacagag 
acttggcaga 
agtcaatgag 
ttaaatgaaa 
cagtcacttt 
gatggtgcca 
tccaccaaga 
atgaacgcat 
tggtcaccac 
agtgaattaa 
ctcaataaag 
nnnnnnnnnn 
nnnnnnnnnn 
ttcttaaaat 
taaattaggg 
cagatacaga 
gcttctgtta 
gtataagcaa 
tttcttatta 
gcgcgtacct 
agggtgtcta 
cgcgtctttc 
tttgttgccg 
cccccgcttc 
cgggcccggt 



ccactcacac 
nnnnnnnnnn 
nnnnnnnnnn 
gntaccccta 
tagagaggca 
tgttgttttg 
tttttttatt 
tcaccttgaa 
gtgagtttgt 
tgatcccccg 
ttcccttggt 
tttttccccg 
tttccggttc 
ccttgcccat 
ccccattcct 
nnnnnnnnnn 
nnnnnnnnnn 
tnanagaata 
aaatatattt 
ataattaaat 
ttgtgtaaag 
gagaaaggga 
agcagtacag 
gtgagggagc 
gatggatgga 
aagagagaga 
ccag.taaaca 
tcggtgaggc 
ccgtgggttc 
nnnnnnnnnn 
tgctgaatct 
tgagacactg 
aaggatttca 
tagtagagct 
aaaattccca 
ccaaatcaag 
gagatcccaa 
cgaatgcctg 
tcatcagcat 
caaagaagtc 
ccattgaatt 
ctgcatatat 
nnnnnnnnnn 
nnnnnnnnnn 
tttaaatgta 
gaacagagac 
atgaatagct 
acattctatg 
aacatacaag 
aagtgtgtgg 
tcccgtccgc 
gtttgctgcg 
cctcattccc 
ttttgggttg 
tcctattcgc 
tgggcccg 



acatacaggc 
nnnnnnnnnn 
nnnnnnnnnn 
caaccatgct 
gctgattgca 
tttgtaaagc 
gaacacacat 
gaactcaact 
tgatcgcccg 
accagagccc 
ttcccctgtc 
tgatccctcc 
acgtttttcg 
cctccttccc 
cacctttcca 
nnnnnnnnnn 
nnnnnnnnnn 
cccctcatct 
aggagcaaat 
ttctcagagc 
attcttaaga 
aaattaaaac 
cagagcacac 
cataggggag 
gggagatggg 
gagagagaca 
aaaatacatc 
cagaccattg 
gcgnnnnnnn 
nnnnnnnnnn 
acctcggtgt 
tctcaaaaaa 
aatagggagc 
gttttgagct 
aaaaagaaaa 
atcccccaca 
tcaagatcat 
ttctacccag 
tcttgtccag 
ttgcaaaagg 
gcacactaaa 
actgcgcctg 
nnnnnnnnnn 
gaactaggtc 
aacatactta 
atgaaaacag 
taccaagttt 
caaagaatac 
atggaaaggc 
ctctgctgag 
gcccccggtt 
tggcacggtc 
cgtgttttgt 
cctgtgtcgc 
tttgacctgc 



acatgcatac 
nnnnnnnnnn 
nnnnnngacc 
gatacaaagc 
ggtcagagac 
cattcccagg 
cgagatttat 
cattatactt 
cttcggcctc 
ggttccggtc 
ccctgtccct 
gcccaccccc 
ccccccatat 
ccccatgcct 
cccccatttt 
nnnnnnnnnn 
nnnnnnnnnn 
ctcttggggt 
caaataccac 
agtttgtcaa 
gatttgttta 
actgaagtag 
cccagcagaa 
agagagcatg 
aagagaggga 
gagaaggaga 
cttattctct 
actgcgccgc 
nnnnnnnnnn 
nnnnnnnnnn 
anagaaaacc 
aaaaaaaaaa 
ttccttcaca 
caagaacatg 
gaaaaaggaa 
catcttgctc 
atctagaagg 
gcctaggaga 
gagtagctca 
acatctagtc 
cgggtgaatt 
tatgtgcggc 
nnnnnnnnnn 
agcagctgca 
ttatttgtaa 
aatagattaa 
ttttttatat 
agtataagca 
agaaggcagc 
gggaagcggg 
ctctctccgt 
tcccggttcc 
tacgctagtc 
gttcccgccc 
gatccccccc 



=> 



L2 ANSWER 1 OF 1 



GENBANK.RTM. COPYRIGHT 2001 



LOCUS (LOC) : 
GenBank ACC. NO. (GBN) 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 

ORGANISM (ORGN) : 



Catarrhini ; 

NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 

JOURNAL (SO) : 



OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 

TITLE (TI) : 



JOURNAL (SO) : 



AB014548 GenBank (R) 

AB014548 

211153-78-5 
5177 

mRNA; linear 
Primates 
6 Feb 1999 

Homo sapiens mRNA for KIAA0648 protein, partial cds . 
Homo sapiens adult male brain cDNA to mRNA, 
clone_lib:pBluescriptII SK plus clone : HJ03994 . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi / Mammalia; Eutheria; Primates; 

Hominidae; Homo 

: 1690 a 931 c 1037 g 1519 t 

1 (bases 1 to 5177) 

Ohara,0.; Suyama,M.; Nagase,T.; Ishikawa^K. 
Direct Submission 

Submitted (26-MAY-1998 ) to the DDBJ/EMBL/GenBank 
databases. Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; Yana 1532-3, Kisarazu, 
Chiba 292-0812, Japan (E-mail : cdnainfoi? kazusa . or . jp, 
Tel: +81-4 38-52-3913, Fax:+81-438-52-3914) 
CA 129:171313 

2 (sites) 

Ishikawa,K.; Nagase,T.; Suyama,M.; Miyajima,N.; 
Tanaka,A.; Kotani,H.; Nomura, N.; Ohara,0. 
Prediction of the coding sequences of unidentified 
human genes. X. The complete sequences of 100 new cDNA 
clones from brain which can code for large proteins in 
vitro 

DNA Res., 5 (3), 169-176 (1998) 



FEATURES (FEAT) : 
Feature Key 



Location 
1. .5177 



Qualifier 



source 



gene 
CDS 



1. .2558 
<1. .255E 



/organism="Homo sapiens" 
/db-xref="taxon: 9606" 
/clone="HJ03994" 

/clone-lib="pBluescriptII SK plus" 

/dev-stage=" adult" 

/sex="male" 

/ t issue- type= "brain" 

/gene="KIAA0648" 

/gene="KIAA0648" 

/codon-start=3 

/product="KIAA064 8 protein" 

/protein-id="BAA31623. 1" 

/db-xref="GI : 3327110" 

/translation="LYYLYASLDPNAVKALNEMW 

KCQNMLRSHVRELLDLHKQPTSEA 

NCSAMFGKLMTIAKNLPDPGKAQDFVKKFNQVLG 

DDEKLRSQLELLISPTCSCKQADI 

CVREIARKLANPKQPTNPFLEMVKFLLERIAPVH 

I DSEAI S ALVKLMNKS I EGTADDE 

EEGVSPDTAIRSGLELLKVLSFTHPTSFHSAETY 



ESLLQCLRMEDDKVAEAAIQI FRN 

TGHKIETDLPQIRSTLIPILHQKAKRGTPHQAKQ 

AVHCIHAIFTNKEVQLAQIFEPLS 

RSLNAD VPEQL I T PLVS LGH I SMLAPDQFAS PMK 

SVVANFIVKDLLMNDRSTGEKNGK 

LWSPDEEVSPEVLAKVQAIKLLVRWLLGMKNNQS 

KSANSTLRLLSAMLVSEGDLTEQK 

RI SKS DMSRLRLAAGS AIMKLAQE PCYHEI I T PE 

QFQLCALVINDECYQVRQIFAQKL 

HKALVKLLLPLEYMAI FALCAKDPVKERRAHARQ 

CLLKNISIRREYIKQNPiyiATEKLL 

SLLPEYVVPYMIHLLAHDPDFTRSQDVDQLRDIK 

ECLWFMLEVLMTKNENNSHAFMKK 

MAENIKLTRDAQSPDESKTNEKLYTVCDVALCVI 

NSKSALCNADSPKDPVLPMKFFTQ 

PEKDFCNDKSYISEETRVLLLTGKPKPAGVLGAV 

NKPLSATGRKPYVRSTGTETGSNI 

NVNSELNPSTGNRSREQSSEAAETGVSENEENPV 

RI I S VT PVKN I DPVKNKE I NS DQA 

TQGNISSDRGKKRTVTAAGAENIQQKTDEKVDES 

GPPAPSKPRRGRRPKSESQGNATK 

NDDLNKPINKGRKRAAVGQESPGGLEAGNAKAPK 

LQDLAKKAAPAERQIDLQR" 

SEQUENCE (SEQ) : 

1 gcttatatta cttatatgct agtttggatc caaatgctgt aaaagctctc aacgaaatgt 
61 ggaagtgtca gaacatgctt cggagccatg tacgcgaact attggatttg cacaagcagc 
121 ctacatcaga ggctaactgt tctgccatgt ttggaaaact gatgaccata gcaaagaatt 
181 tgcctgaccc cgggaaagca caagattttg tgaagaaatt taaccaggtt ctcggcgatg 
241 atgagaaact tcggtctcag ttggagttat taattagccc aacctgttct tgcaaacaag 
301 cagatatttg tgtgagagaa atagcccgga aacttgcaaa tcctaagcaa ccaacaaatc 
361 cttttctaga gatggtcaaa tttctgttgg aaagaatcgc acctgtgcac attgattcag 
421 aagccataag tgcactagtg aaattgatga ataagtcaat agaggggaca gcagatgatg 
481 aagaggaggg tgtaagtcca gatacagcta tccgttcagg acttgaactt cttaaggttc 
541 tgtcttttac acatcctacc tcgttccact ctgcagagac atatgagtcc ttgttacagt 
601 gcctaagaat ggaggatgac aaggtagcag aagctgctat tcaaattttt agaaatacag 
661 gtcacaaaat agaaacagac cttccccaga tacgatcgac cttaattccc attttacatc 
721 aaaaagcaaa gaggggtact ccacaccaag caaaacaggc tgtgcactgt atacacgcca 
781 tattcacaaa taaagaagtc cagcttgcac agatttttga gccactcagt aggagtctga 
841 atgctgatgt gccagaacaa cttataactc cattagtttc attgggccac atttctatgt 
901 tagcaccaga tcagtttgct tccccaatga aatctgtagt agcaaatttt attgtgaaag 
961 atctgctaat gaatgacagg tcaacaggtg aaaagaatgg aaaactgtgg tctccagatg 
1021 aagaggtttc ccctgaagta ctagcaaagg tacaggcaat taaacttctg gtaaggtggc 
1081 tgttgggtat gaaaaacaac cagtctaaat ctgccaattc aacccttcgg ttattatcag 
1141 cgatgttggt tagtgagggt gacctgacag agcaaaagag gatcagtaaa tctgatatgt 
1201 ctcgcttgcg attagctgct ggtagtgcca taatgaagct tgctcaggaa ccttgttacc 
1261 atgaaattat taccccagaa cagtttcagc tctgtgcact tgttattaat gatgagtgtt 
1321 accaagtaag gcagatattt gctcagaagc tgcataaggc acttgtgaag ttactgctcc 
1381 cattggagta tatggcgatc tttgccttgt gtgccaaaga tcctgtgaag gagagaagag 
1441 cacacgcacg acaatgttta ctgaaaaata tcagtatacg cagggaatac attaagcaga 
1501 atcctatggc tactgagaaa ttattatcac tgttgcctga atatgtagtt ccatacatga 
1561 ttcacctgct agcccatgat ccagatttta caagatcaca agatgttgat cagcttcgtg 
1621 atatcaaaga gtgcctatgg ttcatgcttg aagttttaat gacaaagaat gaaaacaata 
1681 gccatgcctt tatgaagaag atggcagaga acatcaagtt aaccagagat gcccagtctc 
1741 cagatgaatc caagacaaat gaaaaactgt atacagtatg tgatgtggct ctctgtgtta 
1801 taaatagtaa aagtgctttg tgcaatgcag attcaccaaa ggacccagtc ctcccaatga 
1861 aattttttac acaacctgaa aaggacttct gtaacgataa gagttatatt tcagaagaga 
1921 caagagtact tctgttaaca ggaaagccaa agcctgctgg agtactaggt gcagtaaata 
1981 agcctttatc agcaacggga aggaaaccct atgttagaag cactggcact gagactggaa 
2041 gcaatattaa tgtaaattca gagctgaacc cttcaaccgg aaatcgatca agggaacaga 



2101 gttcagaggc agcagaaact ggagttagtg aaaatgaaga gaaccctgtg aggattattt 

2161 cagtcacacc tgtaaagaat attgacccag taaagaataa ggaaattaat tctgatcagg 

2221 ctacccaggg caacatcagc agtgaccgag gaaagaaaag aacagtaaca gcagctggtg 

2281 cagagaatat ccaacaaaaa acagatgaga aagtagatga atcgggacct cccgcccctt 

2341 ccaaacccag gagaggacgt cgacccaagt ctgaatctca gggcaatgct accaaaaatg 

24 01 atgatctaaa taaacctatt aacaagggaa ggaagagagc tgcagtgggt caggagagcc 

24 61 ctgggggttt ggaagcaggt aatgccaaag cacccaaact gcaagattta gccaaaaagg 

2521 cagcaccagc agaaagacaa attgacttac aaaggtaaaa atgcatttgc aaagggagaa 

2581 aatgaaggcc aaacagaagc aggctccagc ttctgcaaaa acttggattc acaaatgtcc 

2641 ctgaacagaa aatgaagctc acttcagaac acacactctc tgccttgaaa actaaagaga 

2701 ctattacttc cttttcacat gaccacaagt cctctgatgg aaatgtacag cagaaactct 

2761 tgagagagag gctaaaagca actctgttct cccccttccc ctagactttt cttacgaaaa 

2821 gtcaataatt aagcaaattg cttaacactt ggttccagtt cctgcctatc tggagtttaa 

2881 atgcgtaata caccattaat ttccacgctg cagtttttat tttaaagaaa gtaacaagat 

2941 gtctttacac tgacactgaa aattcatcca ttttagagcc aggaattccc atgttacaca 

3001 ggaaaaaata gaagtctact gaattaattt tttaaaagaa aagagatcag attaaatatt 

3061 tctttgtttt tccttttgga aacttttatg tataattctt tctgcctgcc tacttttctg 

3121 caaaaatgag atgtacagat ttcggttccc tgctatgaaa agtgatgtgg tagcaatttt 

3181 ataaatgttg ctttctgatt tttatcagag tgagaaaatt aaaattattg atttgcaagt 

3241 agtaaacagt tcatattttg atttcccctc attttagttt aatataattt gcaataaatg 

3301 tacatattgt tgtttgtttc ataaagcata tcactttaaa atggttttta ctcctgtgat 

3361 tatgttggaa tatttggaat tttaaaggag taaagactgt ccagcatttg gttttataat 

3421 gtttgtcacc agatttttat taatgtaaaa aatcaatttt taaaaaatag ttggactttg 

3481 gcagctttta aggaaagttg gaggtgtttt aggattgcta tcaattttca gcattgtgct 

3541 atttggaaat aagtgttttg cttttgtctg atggtctggg ctcattttta tgtttatttt 

3601 agaaaactgt tgcatcaata tattatgttt cttggcattg ttcagcatag gtaatgtgtg 

3661 cactttatgt gtacacataa tcatatttaa gttttttgca taaaataaat gcttctagat 

3721 gtcatggcag tctttttaat ctttttatca tatgctttct tgtgaatttt ttcatgttaa 

3781 agagctaaag tcataacatg attacagtca actctccatt atctatataa aatagtgact 

3841 aagcctcagg tttttaattt tgtgataaca aaataacgaa ggcatgtaag acctgattct 

3901 ggaggaacat gaaatttgtc ttttctcatg tccagagttc tatcctgccc ccactgtcca 

3961 ctgtagggtc atccgcaaag ccctagcaga atgtgctcac tccatttcct tacacgtttc 

4021 tagcatgggt cagaggaaac aacatttgtg ttataacttc gtcttgatag gctgtagtgt 

4081 acatgggatg taaaacaaac aagtgtatca aaggtggatg attctgttag agtgaagttt 

4141 gagagtaaat gtcacttacg tttctcatag atgatcaaga gttggctgtg tattgactga 

4201 aagatgggta attattttaa atatgcattt acacacattt aggtatcaga agatgcttag 

4261 ggaacaatgg ataccaatga tagaaaatga tacctttaca ggggcagaaa aatccccact 

4321 cttccttatt gcctcttcag aaccctttag aaagtataaa atattgcctc caacatgctg 

4381 aaaaagagta tctatgcata agtatcagag aagtccctca agcaatcagt aggtgtgttc 

4441 tatttagaga gagtttaaag ttctcttagc atcagacaac ttgattccta aggtttccag 

4501 tgtgtcacca acaaaaagtg cattgatagg gacctttgtc tcttcctccc tttgattaat 

4561 tgcccggcat cacagtttac tagattacca agtgttacat catattaaat aaaatgtagc 

4 621 agaaccatct gcatcaatat attcctgttt agatttttgc aggagagaag ttaaaaggat 

4 681 ttgctccttg tatgatgtaa gtggcccacc ccaattttgt aacatgatgc aagtgtctgg 

4741 cactaaggga agcaagagta gggttgtgga aagaccaagc tgatggggag ggacttgttt 

4801 acgggaattt ttttagtttt ccttttcaaa ggaaaacatt aaaatccctt aggaatttgg 

4861 tattcacatc tcagagaact acaacacaaa agtgcagact tatatttgag aattaatgtt 

4 921 aaccctttgt gtctagtttg aagcttcttg tatttgtcta aaacaacaag ccagaatttt 

4 981 gtatctcctt tgataaaaag tgtgtataat gtaaagtagt tttgcatatt cttgtgctgc 

5041 acatgggctg aatttttaaa ttttttttaa aaacttgaag cagaaccttg taatttgtgt 

5101 aaatgacaag tgtaaaatcc taccataaaa tgctaaaaat atgcactgtt tcaaataaaa 
5161 ccaagaaatg cagcatt 



=> 
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LOCUS (LOC) : 
GenBank ACC. NO. (GEN) 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 

ORGANISM (ORGN) : 



Catarrhini ; 

NUCLEIC ACID COUNT (NA) 
REFERENCE: 

AUTHOR (AU) : 



TITLE (TI] 



in 

JOURNAL (SO) : 
OTHER SOURCE (OS) : 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



AB023196 GenBank (R) 

AB023196 

222526-42-3 

5110 

mRNA; linear 
Primates 
16 Jun 1999 

Homo sapiens mRNA for KIAA0979 protein, partial cds . 
Homo sapiens adult male brain cDNA to mRNA, 
clone_lib:pBluescriptII SK plus clone : hj 07056 . 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; 

Hominidae; Homo 

: 1714 a 919 c 1045 g 1432 t 

1 (sites) 

Nagase,T.; Ishikawa,K.; Suyama,M.; Kikuno,R.; 
Hirosawa,M.; Miyajima,N.; Tanaka,A. ; Kotani,H.; 
Nomura, N. ; Ohara, O. 

Prediction of the coding sequences of unidentified 
human genes. XIII. The complete sequences of 100 new 
cDNA clones from brain which code for large proteins 

vitro 

DNA Res. , 6 (1) , 63-70 (1999) 
CA 131:28520 

2 (bases 1 to 5110) 

Ohara, 0.; Nagase,T.; Kikuno,R. 
Direct Submission 

Submitted (04-FEB-1999) to the DDB J/EMBL/GenBank 
databases. Osamu Ohara, Kazusa DNA Research Institute, 
Laboratory of DNA Technology; Yana 1532-3, Kisarazu, 
Chiba 292-0812, Japan (E-mail : cdnainfo@kazusa . or . jp, 
Tel: +81-4 38-52-3913, Fax:+81-4 38-52-3914) 



FEATURES (FEAT) : 
Feature Key 

source 1 . .5110 



Location 



Qualifier 



gene 
CDS 



1. .4253 
<1. .4253 



/organism="Homo sapiens" 
/db-xref ="taxon : 9606" 
/clone-"hj07056" 

/clone-lib="pBluescriptII SK plus" 

/dev-stage=" adult " 

/sex="male" 

/tis sue- type= "brain" 

/gene="KIAA0979" 

/gene="KIAA0979" 

/codon-start=3 

/product="KIAA0979 protein" 

/protein-id="BAA76823 . 1" 

/db-xref="GI : 4589602" 

/translation-"VRRLKMVVKTFMDMDQDSEE 

EKELYLNLALHLASDFFLKHPDKD 

VRLLVACCLADIFRIYAPEAPYTSPDKLKDIFMF 

ITRQLKGLEDTKSPQFNRYFYLLE 

NIAWVKSYNICFELEDSNEIFTQLYRTLFSVINN 



GHNQKVHMHMVDLMSSI ICEGDTV 

SQELLDTVLVNLVPAHKNLNKQAYDLAKALLKRT 

AQAIEPYITNFFNQVLMLGKTSIS 

DLSEHVFDLILELYNIDSHLLLSVLPQLEFKLKS 

NDNEERLQVVKLLAKMFGAKDSEL 

ASQNKPLWQCYLGRFNDIHVPIRLECVKFASHCL 

MNHPDLAKDLTEYLKVRSHDPEEA 

IRHDVIVSIVTAAKKDILLVNDHLLNFVRERTLD 

KRWRVRKEAMMGLAQI YKKYALQS 

AAGKDAAKQIAWIKDKLLHIYYQNSIDDRLLVER 

I FAQYMVPHNLETTERMKCLYYLY 

ATLDLNAVKALNEMWKCQNLLRHQVKDLLDLIKQ 

PKTDASVKAIFSKVMVITRNLPDP 

GKAQDFMKKFTQVLEDDEKIRKQLEVLVSPTCSC 

KQAEGCVREITKKLGNPKQPTNPF 

LEMI KFLLERI APVH I DTES I SALI KQVNKS I DG 

TADDEDEGVPTDQAIRAGLELLKV 

LSFTHPISFHSAETFESLLACLKMDDEKVAEAAL 

QIFKNTGSKIEEDFPHIRSALLPV 

LHHKSKKGPPRQAKYAIHCIHAIFSSKETQFAQI 

FEPLHKSLDPSNLEHLITPLVTIG 

HIALLAPDQFAAPLKSLVATFIVKDLLMNDRLPG 

KKTTKLWVPDEEVSPETMVKIQAI 

KMMVRWLLGMKNNHSKSGTSTLRLLTTILHSDGD 

LTEQGKISKPDMSRLRLAAGSAIV 

KLAQEPCYHEIITLEQYQLCALAINDECYQVRQV 

FAQKLHKGLSRLRLPLEYMAICAL 

CAKDPVKERRAHARQCLVKNINVRREYLKQHAAV 

SEKLLSLLPEYVVPYTIHLLAHDP 

DYVKVQDIEQLKDVKECLWFVLEILMAKNENNSH 

AFIRKMVENIKQTKDAQGPDDAKM 

NEKLYTVCDVAMNIIMSKSTTYSLESPKDPVLPA 

RFFTQPDKNFSNTKNYLPPEMKSF 

FTPGKPKTTNVLGAVNKPLSSAGKQSQTKSSRME 

TVSNASSSSNPSSPGRIKGRLDSS 

EMDHSENEDYTMSSPLPGKKSDKRDDSDLVRSEL 

EKPRGRKKTPVTEQEEKLGMDDLT 

KLVQEQKPKGSQRSRKRGHTASESDEQQWPEEKR 

LKEDILENEDEQNSPPKKGKRGRP 

PKPLGGGTPKEEPTMKTSKKGSKKKSGPPAPEEE 

EEEERQSGNTEQKSKSKQHRVSRR 

AQQRAESPESSAIESTQSTPQKGRGRPSKTPSPS 

QPKKNVRVGRSKQAATKENDSSEE 

VDVFQGSSPVDDIPQEETEEEEVSTVNVRRRSAK 

RERR" 



SEQUENCE (SEQ) : 

1 tggtgagacg 

61 aagaaaagga 

121 atcctgataa 

181 atgctcctga 

241 caagacagtt 

301 tacttgagaa 

361 atgaaatttt 

421 agaaagtcca 

481 tgtctcagga 

541 acaagcaagc 

601 catatattac 

661 tgtcagagca 

721 tctctgtttt 



attaaagatg 
gctttattta 
agatgttcgc 
agctccttac 
gaaggggcta 
cattgcttgg 
cacccagcta 
tatgcacatg 
gcttttggat 
atatgatttg 
caattttttt 
tgtctttgac 
accccagctt 



gttgtgaaaa 
aacctagctt 
ttactggtag 
acatcccctg 
gaggatacaa 
gtcaagtcat 
tacagaacct 
gtagacctta 
acggttttag 
gcaaaggctt 
aatcaggttc 
ttaattttgg 
gaatttaaat 



cttttatgga 
tacatcttgc 
cctgctgcct 
ataaactaaa 
agagcccaca 
ataacatatg 
tattttcagt 
tgagctctat 
taaatctggt 
tactgaagag 
tgatgcttgg 
agctctacaa 
taaagagcaa 



tatggaccag 
ttcagatttt 
tgctgatatt 
ggatatattt 
gttcaatagg 
ctttgagtta 
tataaacaat 
tatttgtgaa 
acctgctcat 
gacagctcaa 
gaaaacatct 
tattgatagt 
tgataatgag 



gactctgaag 
tttctcaagc 
ttcaggattt 
atgtttataa 
tatttttatt 
gaagatagca 
ggccacaatc 
ggtgatacag 
aagaatttaa 
gctattgagc 
atcagcgatt 
catttgctgc 
gagcgcctac 



781 aagttgttaa actactggca aaaatgtttg 
841 acaagccact ttggcagtgc tacttgggca 
901 tggaatgtgt gaaatttgct agccattgtc 
961 taacagagta tcttaaagtg aggtcacatg 
1021 ttgtgtcaat agttacagct gctaaaaagg 
1081 attttgtgag agagagaaca ttagacaaac 
1141 gacttgccca aatttataag aaatatgctt 
1201 aacagatagc atggatcaaa gacaaattgc 
1261 atcgactact tgttgaacgg atctttgctc 
1321 cagaacggat gaaatgctta tattacttgt 
1381 cattgaatga aatgtggaaa tgtcaaaatc 
1441 acttgattaa gcaacccaaa acagatgcca 
1501 ttattacaag aaatttacct gatcctggta 
1561 aggtgttaga agatgatgag aaaataagaa 
1621 gctcctgcaa gcaggctgaa ggttgtgtgc 
1681 aacagcctac aaatcctttc ctggaaatga 
1741 tgcacataga taccgaatct atcagtgctc 
1801 gaacagcaga tgatgaagat gagggtgttc 
1861 aactgcttaa ggtactctca tttacacatc 
1921 aatcattact ggcttgtctg aaaatggatg 
1981 ttttcaaaaa cacaggaagc aaaattgaag 
2041 ttcctgtttt acatcacaaa tctaaaaaag 
2101 attgtatcca tgcgatattt tctagtaaag 
2161 tgcataagag cctagatcca agcaacctgg 
2221 gtcatattgc tctccttgca cctgatcaat 
2281 ctttcattgt gaaagatctt ctcatgaatg 
2341 tttgggttcc agatgaagaa gtatctcctg 
2401 tgatggttcg atggctactt ggaatgaaaa 
24 61 taagattgct aacaacaata ttgcatagtg 
2521 gtaaaccaga tatgtcacgt ctgagacttg 
2581 aagaaccctg ttaccatgaa atcatcacat 
2641 tcaacgatga atgctatcaa gtaagacaag 
2701 cccgtttacg gcttccactt gagtatatgg 
2761 taaaggagag aagagctcat gctaggcaat 
2821 agtatctgaa gcagcatgca gctgttagtg 
2881 ttgttccata tacaattcac cttttggcac 
2941 ttgaacaact taaagatgtt aaagaatgtc 
3001 aaaatgaaaa taacagtcac gcttttatca 
3061 aagatgccca aggaccagat gatgcaaaaa 
3121 ttgccatgaa tatcatcatg tcaaagagta 
3181 cggtactacc agctcgtttc ttcactcaac 
3241 atctgcctcc tgaaatgaaa tcatttttca 
3301 taggagctgt taacaagcca ctttcatcag 
3361 gaatggaaac tgtaagcaat gcaagcagca 
3421 aggggaggct tgatagttct gaaatggatc 
3481 cacctttgcc ggggaaaaaa agtgacaaga 
3541 tggagaagcc tagaggcagg aaaaaaacgc 
3601 tggatgactt gactaagttg gtacaggaac 
3661 aaagaggcca tacggcttca gaatctgatg 
3721 aagaagatat attagaaaat gaagatgaac 
3781 gccgaccacc aaaacctctt ggtggaggta 
3841 ctaaaaaagg aagcaaaaaa aaatctggac 
3901 aaagacaaag tggaaatacg gaacagaagt 
3961 gagcacagca gagagcagaa tctcctgaat 
4021 cacagaaagg acgaggaaga ccatcaaaaa 
4081 tccgtgtagg acgctccaaa caagcagcta 
4141 atgtgtttca gggtagctct cctgtcgatg 
4201 aagtttctac agtaaatgta cggcggcgaa 
4261 taattaataa ctttctctgt gaaagctttg 
4321 caagcttgag gctgaataaa gcctttgatg 



gggcaaagga ttcagaattg gcttctcaaa 
ggtttaatga tatccatgta ccaatccgcc 
tcatgaacca tcctgattta gcaaaagact 
accctgagga agctattaga catgatgtta 
atattcttct ggtcaatgat cacttactta 
gatggagagt acgcaaagaa gccatgatgg 
tacagtcagc agctggaaaa gatgctgcaa 
tacatatata ttatcaaaat agtattgatg 
aatacatggt tcctcacaat ttagaaacta 
atgccacact ggatttaaat gctgtgaaag 
tgctccgaca tcaagtaaag gatttgcttg 
gtgtcaaggc catattttca aaagtgatgg 
aggctcagga tttcatgaag aaattcacac 
agcagttaga agtacttgtt agtccaacat 
gtgaaataac taagaagttg ggcaacccca 
tcaagtttct cttggagagg atagcacctg 
ttattaaaca agtgaacaaa tcaatagatg 
caactgatca agccatcaga gcaggtcttg 
ccatctcatt tcattctgct gaaacatttg 
atgaaaaagt agcagaagct gcactacaaa 
aggattttcc acacatcaga tcagccttgc 
gacccccccg tcaagccaaa tatgccattc 
agacccagtt tgcacagata tttgagcctc 
aacatctcat aacaccattg gttactattg 
ttgctgctcc tttgaaatct ttggtagcta 
atcggcttcc agggaaaaag acaactaaac 
agacaatggt caaaattcag gctattaaaa 
ataatcacag taaatcagga acttctacct 
atggagactt gacagaacag gggaaaatta 
ctgctgggag tgctattgtg aagctggcac 
tagaacaata tcagctatgt gcattagcta 
tgtttgccca gaaacttcac aaaggccttt 
caatctgtgc cctttgtgca aaagatcctg 
gtttggtgaa aaatataaat gtaaggcggg 
aaaaattatt gtctcttcta ccagagtatg 
atgacccaga ttatgtcaaa gtacaggata 
tttggtttgt tctggaaata ttaatggcta 
gaaagatggt agaaaatatt aaacaaacaa 
tgaatgaaaa actgtacact gtgtgtgatg 
ctacatacag tttggaatct cctaaagacc 
ctgacaagaa tttcagtaac accaaaaatt 
ctcctggaaa acctaaaaca accaatgttc 
caggcaagca atctcagacc aaatcatcac 
gctcaaatcc aagctctcct ggaagaataa 
acagtgaaaa tgaagattac acaatgtctt 
gagacgactc tgatcttgta aggtctgaat 
ccgtcacaga acaggaggag aaattaggta 
agaaacctaa aggcagtcag cgaagtcgga 
aacagcagtg gcctgaggaa aagaggctca 
agaatagtcc gccaaaaaag ggtaaaagag 
caccaaaaga agagccaaca atgaaaactt 
ctccagcacc agaggaggag gaagaagaag 
ccaaaagcaa acagcaccga gtgtcaagga 
ctagtgcaat tgaatccaca cagtccacac 
cgccatcacc atcacaacca aaaaaaaatg 
ctaaggaaaa tgattcaagt gaagaagtag 
atattccaca ggaagaaaca gaggaggagg 
gtgctaaaag ggaacggcga tgaacaaatg 
gaaaaatctt tttttttttt tttttttggt 
cacaaaatgg gactgctgaa gagtggacag 



4381 
4441 
4501 
4561 
4621 
4681 
4741 
4801 
4861 
4921 
4981 
5041 
5101 



ttggacctta 
taacattgac 
agaaacttgt 
tagcttttat 
gaaaagaatt 
tctgcaattt 
cactatttgt 
cctgggcacc 
tctgatagga 
cacctcagta 
aagaaagatt 
tttatcccat 
aatttgaatg 



ctttggtgac 
tatggagtct 
aaatatcttt 
ctgcggcttt 
tgttaggata 
tactgtgaaa 
tggagagtta 
cttaatcttc 
ggacaaccat 
ttgatttgtc 
taaagtattt 
tttatcatcc 



cccatacgtt 
tgtgaaagtg 
tttctttttt 
aaactgacag 
gatcttaagc 
aaaaatttgt 
aatggtctct 
agaggtgcta 
gcaaattgtg 
ccagaatttt 
taattttaaa 
tttcagtttt 



tgtggtcaca 
taatgtgcga 
ttaatgtttc 
tacccgactg 
agtaatctgt 
tttcaacaat 
tccctttgtg 
aattgtctgc 
aaatagtcct 
ctggcctttc 
gagtgtgtta 
tattaatcta 



tgctttagcc 
tggctatgta 
tgatttctga 
tttattggat 
cagtgtttgt 
tggtgtcatt 
tatcttacct 
cattacacca 
gaagttcttg 
atggcaatga 
taaaataatg 
ctgtatcaat 



atacacatgg 
gacataaaga 
agtgcttgta 
ctattgattt 
atttgtattc 
ttcttgatgt 
agtgtttact 
gaaggatgcc 
gattacttta 
aaattttaag 
tactgaattc 
aaaattctgt 



=> 
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GENBANK.RTM. COPYRIGHT 2001 



LOCUS (LOG) : 
GenBank ACC. NO. (GBN) : 
CAS REGISTRY NO. (RN) : 
SEQUENCE LENGTH (SQL) : 
MOLECULE TYPE (CI) : 
DIVISION CODE (CI) : 
DATE (DATE) : 
DEFINITION (DEF) : 
SOURCE: 
ORGANISM (ORGN) : 



Catarrhini ; 

NUCLEIC ACID COUNT (NA) 
REFERENCE : 

AUTHOR (AU) : 



Thakur, S . ; 
Wong, A.K.C. ; 

TITLE (TI) : 

JOURNAL (SO) : 
OTHER SOURCE (OS) 
REFERENCE: 

AUTHOR (AU) : 
TITLE (TI) : 
JOURNAL (SO) : 



HSU50533 GenBank (R) 

U50533 

180884-10-0 

1852 

mRNA; linear 
Primates 
27 Nov 1996 

Human BRCA2 region, mRNA sequence CG008 . 
human . 

Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; 
Euteleostomi; Mammalia; Eutheria; Primates; 

Hominidae; Homo 

: 609 a 348 c 379 g 505 t 11 others 

1 (bases 1 to 1852) 

Couch, F.J,; Rommens, J. M. ; Neuhausen, S . L . ; Belanger,C.; 
Dumont,M.; Kenneth, A.; Bell,R.; Berry, S.; Bogden,R.; 
Cannon-Albright, L. ; Farid, L. ; Frye, C. ; Hattier, T. ; 
Janecki,T.; Jiang, P.; Kehrer,R.; Leblanc, J. -F. ; 
McArthur-Morrison, J, ; McSweeney, D. ; Miki, Y. ; Peng, Y. ; 
Samson, C. ; Schroeder,M. ; Snyder, S.C. ; Stringf ellow, M. ; 
Stroup,C.; Swedlund,B.; Swensen,J.; Teng,D.; 

Tran, T . ; Tranchant, M. ; Welver-Feldhaus , J. ; 

Shizuya, H . ; Labrie, F. ; Skolnick, M. H . ; Goldgar, D. E . ; 
Kamb,A.; Weber, B.L.; Tavtigian, S . V . ; Simard,J. 
Generation of an integrated transcription map of the 
BRCA2 region on chromosome 13ql2-ql3 
Genomics, 36 (1), 86-99 (1996) 
CA 125:187182 

2 (bases 1 to 1852) 
Simard, J. 

Direct Submission 

Submitted ( 04-MAR-1996) Jacques Simard, Laboratory of 
Molecular Endocrinology, CHUL Research Center, 2705, 
Boulevard Laurier, Quebec City, Quebec GIV 4G2, Canada 



FEATURES (FEAT) 
Feature Key 



Location 
1852 



Qualifier 



source 



/organism="Homo sapiens" 
/db-xref-"taxon: 9606" 
/chromosome="13" 
/map="13ql2-ql3" 
/note="CG008; DSEG numbers: 
D13S171 and D13S310" 



SEQUENCE (SEQ) : 

1 ttaaagtgag 

61 ttacagctgc 

121 agagaacatt 

181 tttataagaa 

241 ggatcaaaga 

301 ttgaacggat 

361 aatgcttata 



gtcacatgac 
taaaaaggat 
agacaaacga 
atatgcttta 
caaattgcta 
ctttgctcaa 
ttacttgtat 



cctgaggaag 
attcttctgg 
tggagagtac 
cagtcagcag 
catatatatt 
tacatggttc 
gccacactgg 



ctattagaca 
tcaatgatca 
gcaaagaagc 
ctggaaaaga 
atcaaaatag 
ctcacaattt 
atttaaatgc 



tgatgttatt 
cttacttaat 
catgatggga 
tgctgcaaaa 
tattgatgat 
agaaactaca 
tgtgaaagca 



gtgtcaatag 
tttgtgagag 
cttgcccaaa 
cagatagcat 
cgactacttg 
gaacggatga 
ttgaatgaaa 



421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 



tgtggaaatg 
aacccaaaac 
atttacctga 
atgatgagaa 
aggctgaagg 
atcctttcct 
ccgaatctat 
atgaagatga 
tactctcatt 
cttgtctgaa 
caggaagcaa 
atcacaaatc 
gcgatatttt 
ctagatccaa 
ctccttgcac 
aaagatcttc 
gatgaagaag 
tggctacttg 
aacaacaata 
gtcacgtctg 
ccatgaaatc 
ctatcaagta 
ttccacttga 
gagctcatgc 



tcaaaatctg 
agatgccagt 
tcctggtaag 
aataagaaag 
ttgtgtgcgt 
ggaaatgatc 
cagtgctctt 
gggtgttcca 
tacacatccc 
aatggatgat 
aattgaagag 
taaaaaagga 
ctagtaaaga 
gcaacctgga 
ctgatcaatt 
tcatgaatga 
tatctcctga 
gcatgtaamt 
tcgcatcgtg 
agacttgctg 
rtcacawtag 
agacaagtgt 
gaatatggca 
taggcaatgt 



ctccgacatc 
gtcaaggcca 
gctcaggatt 
cagttagaag 
gaaataacta 
aagtttctct 
attaaacaag 
actgatcaag 
atctcatttc 
gaaaaagtag 
gattttccac 
cccccccgct 
gacycagttt 
acatctcata 
tgctgctcct 
tcggcttcca 
gacaatggtc 
tamtcacagt 
atggggattg 
ctgggagtgc 
aacaatatca 
ttggcccaga 
atctgtgccc 
ttggtgaaaa 



aagtaaagga 
tattttcaaa 
tcatgaagaa 
tacttgttag 
agaagttggg 
tggagaggat 
tgaacaaatc 
ccatcagagc 
attctgctga 
cagaagctgc 
acatcagatc 
caagccaaat 
gcacagatat 
acaccattgg 
ttgaaatctt 
gggaaaaaga 
aaaattcagg 
atatcaggat 
gmagamcagg 
tattrtgaag 
gctatgtgcr 
aacttcgcaa 
tttgtgcaaa 
atataaatgt 



tttgcttgac 
agtgatggtt 
attcacacag 
tccaacatgc 
caaccccaaa 
agcacctgtg 
aatagatgga 
aggtcttgaa 
aacatttgaa 
actacaaatt 
agccttgctt 
atgccattca 
ttgagcctct 
ttactattgg 
tggtagctac 
caactaaact 
ctattaaaat 
cttcttcctt 
ggaaattagt 
ctggcacaag 
ttagctatca 
aggcctttcc 
agatcctgta 
aaagcgggac 



ttgattaagc 
attacaagaa 
gtgttagaag 
tcctgcaagc 
cagcctacaa 
cacatagata 
acagcagatg 
ctgcttaagg 
tcattactgg 
ttcaaaaaca 
cctgttttac 
ttgtatccat 
gcataagagc 
tcatattgct 
tttcattgyg 
ttgggttcca 
gatggttcga 
aagsattgct 
aaccagatat 
aaccctgtta 
acgatgaatg 
cgtttgcggc 
aaggagagaa 
tg 



